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(54) Title: MAMMALIAN TOXICOLOGICAL RESPONSE MARKERS 

^ (57) Abstract: The present invention relates to mammalian nucleic acid and protein molecules comprising a plurality of nucleic 
acid and protein molecules. The mammalian nucleic acid molecules can be used as hybridizable array elements in a microarray in 
^ diagnostic and therapeutic applications including detecting metabolic and toxicological responses, and in monitoring drug mecha- 
£>■ nism of action. The protein molecules can be used in a pharmaceutical composition. The present invention also relates to methods 
^ for screening compounds and therapeutics for metabolic responses indicative of a toxic compound or molecule. 
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MAMMALIAN TOXICOLOGICAL RESPONSE MARKERS 

This application is filed under the Patent Cooperation Treaty and claims the benefit of U.S. 
Nonprovisional Application No. 09/443,184, our Docket No. PC-0007 US, filed 19 th November, 1999. 

5 

TECHNICAL FIELD 

The present invention relates to mammalian nucleic acid and protein molecules, and methods for 
their use in diagnostic and therapeutic applications including detecting metabolic and toxicological 
responses, and in monitoring drug mechanism of actioa 

10 

BACKGROUND ART 

Toxicity testing is a mandatory and time-consuming part of drug development programs in the 
pharmaceutical industry. A more rapid screen to determine the effects upon metabolism and to detect 
toxicity of lead drug candidates may be the use of gene expression microarrays. For example, 

1 5 microarrays of various kinds may be produced using full length genes or gene fragments. These arrays 
can then be used to test samples treated with the drug candidates to elucidate the gene expression pattern 
associated with drug treatment. This gene pattern can be compared with gene expression pattens 
associated with compounds which produce known metabolic and toxicological responses. 

Benzo(a)pyrene is a known rodent and likely human carcinogen and is the prototype of a class of 

2 0 compounds, the polycyclic aromatic hydrocarbons (PAH). It is metabolized by several forms of 
cytochrome P450 (P450 isozymes) and associated enzymes to form both activated and detoxified 
metabolites. The ultimate metabolites are the bay-region diol epoxide, benzo(a)pyrene-7,8-diol-9, 1 0- 
epoxide (BPDE) and the K-region diol epoxide, 9-hydroxy benzo(a)pyrene-4,5 -oxide, both of which 
induce formation of DNA adducts. DNA adducts have been shown to persist in rat liver up to 56 days 

2 5 following treatment with benzo(a)pyrene at a dose of 10 mg/kg body weight three times per week for two 
weeks (Qu and Stacey (1996) Carcinogenesis 17:53-59). 

Acetaminophen is a widely-used analgesic. It is metabolized by specific cytochrome P450 
isozymes with the majority of the drug undergoing detoxification by glucuronic acid, sulfate and 
glutathione conjugation pathways. However, at supratherapeutic doses, acetaminophen is metabolized to 

30 an active intermediate, tf-acetyl-p-benzoquinone imine (NAPQI) which can cause hepatic and renal 
failure. NAPQI then binds to sulfhydryl groups of proteins causing their inactivation and leading to 
subsequent cell death (Kroger et aL (1997) Gen. Pharmacol. 28:257-263). 

Clofibrate is an hypolidemic drug which lowers elevated levels of serum triglycerides. In 
rodents, chronic treatment produces hepatomegaly and an increase in hepatic peroxisomes (peroxisome 
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proliferation). Peroxisome proliferators (PPs) are a class of drugs which activate the PP-activated 
receptor in rodent liver, leading to enzyme induction, stimulation of S-phase, and a suppression of 
apoptosis (Hasmall and Roberts (1999) Pharmacol. Then 82:63-70). PPs include the fibrate class of 
hypolidemic drugs, phenobarbitone, thiazolidinediones, certain non-steroidal anti-inflammatory drugs, 
5 and naturally-occuring fatty acid-derived molecules (Gelman et al (1999) Cell. Mol. Life Sci. 55:932- 
943). Clofibrate has been shown to increase levels of cytochrome P450 4A. It is also involved in 
transcription of P-oxidation genes as well as induction of PP-activated receptors (Kawashima et al. 
(1997) Arch. Biochem. Biophys. 347: 148-154). Peroxisome proliferation that is induced by both 
clofibrate and the chemically-related compound fenofibrate is mediated by a common inhibitory effect on 

10 mitochondrial membrane depolarization (Zhou and Wallace (1999) Toxicol. Sci. 48:82-89). 

Toxicological effects in the liver are also induced by other compounds. These can include 
carbon tetrachloride (a necrotic agent), hydrazine (a steatotic agent), a-naphthylisothiocyanate (a 
cholestatic agent), 4-acetylaminofluorene (a liver mitogen), and their corresponding metabolites, which 
are used in experimental protocols to measure toxicological responses (Waterfield et al. (1993) Arch. 

15 Toxicol. 67:244-254). 

The present invention provides mammalian nucleic acid and protein molecules, their use in 
diagnostic and therapeutic applications including detecting metabolic and toxicological responses, and in 
monitoring drug mechanism of action. 



2 0 DISCLOSURE OF INVENTION 

The invention provides a method for detecting or diagnosing the effect of a test compound or 
molecule associated with increased or decreased levels of nucleic acid molecules in a mammalian subject. 
The method comprises treating a mammalian subject with a known toxic compound or molecule which 
elicits a toxicological response, measuring levels of a plurality of nucleic acid molecules, selecting from 

25 the plurality of nucleic acid molecules those nucleic acid molecules that have levels modulated in 
samples treated with known toxic compounds or molecules when compared with untreated samples. 
Some of the levels may be upregulated by a toxic compound or molecule, others may be downregulated 
by a toxic compound or molecule, and still others may be upregulated with one known toxic compound or 
molecule and be downregulated with another known toxic compound or molecule. The selected nucleic 

30 acid molecules which are upregulated and downregulated by a known toxic compound or molecule are 
arrayed upon a substrate. The method further comprises measuring levels of nucleic acid molecules in 
the sample after the sample is treated with the toxic compound or molecule. Levels of nucleic acid 
molecules in a sample so treated are then compared with the plurality of the arrayed nucleic acid 
molecules to identify which sample nucleic acid molecules are upregulated and downregulated by the test 
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compound or molecule. In one embodiment, the nucleic acid molecules are hybridizable array elements 
of a microarray. 

Preferably, the comparing comprises contacting the arrayed nucleic acid molecules with the 
sample nucleic acid molecules under conditions effective to form hybridization complexes between the 
5 arrayed nucleic acid molecules and the sample nucleic acid molecules; and detecting the presence or 

absence of the hybridization complexes. In this context, similarity may mean that at least 1, preferably at 
least 5, more preferably at least 10, of the upregulated arrayed nucleic acid molecules form hybridization 
complexes with the sample nucleic acid molecules at least once during a time course to a greater extent 
than would the probes derived from a sample not treated with the test compound or molecule or a known 

10 toxic compound or molecule. Similarity may also mean that at least 1 , preferably at least 5, more 

preferably at least 10, of the downregulated arrayed nucleic acid molecules form hybridization complexes 
with the sample nucleic acid molecules at least once during a time course to a lesser extent than would 
the sample nucleic acid molecules of a sample not treated with the test compound or a known toxic 
compound. In one aspect, the arrayed nucleic acid molecules comprise SEQ ID NOs: 1-47 or fragments 

15 thereof. 

Preferred toxic compounds are selected from the group consisting of hypolipidemic drugs, n- 
alkylcarboxylic acids, n-alkylcarboxylic acid precursors, azole antifungal compounds, leukotriene D4 
antagonists, herbicides, pesticides, phthalate esters, phenyl acetate, dehydroepiandrosterone (DHEA), 
oleic acid, methanol and their corresponding metabolites, acetaminophen and its corresponding 

2 0 metabolites, benzo(a)pyrene, 3-methylcholanthrene, benz(a)anthracene, 7,12-dimethylbenz(a)anthracene, 
their corresponding metabolites, and the like, carbon tetrachloride, hydrazine, a-naphthylisothiocyanate, 
4-acetylaminofluorene, and their corresponding metabolites. Preferred tissues are selected from the 
group consisting of liver, kidney, brain, spleen, pancreas and lung. 

The arrayed nucleic acid molecules comprise fragments of messenger RNA transcripts of genes 

25 that are upregulated-or-downregulated at least 2-fold, preferably at least 2.5-fold, more preferably at least 
3-fold, in tissues treated with known toxic compounds when compared with untreated tissues. Preferred 
arrayed nucleic acid molecules are selected from the group consisting of SEQ ID NOs: 1-47 or fragments 
thereof, some of whose expression is upregulated following treatment with a toxic compound or molecule 
and others of whose expression is downregulated following treatment with a toxic compound or 

30 molecule. More preferable are SEQ ID NOs:2, 4, 6, 8, 9, and 1 1 which are upregulated following 

treatment with a toxic compound or molecule, and SEQ ID NOs:l, 4, and 7 which are downregulated 
following treatment with a toxic compound or molecule. 

The invention also provides a method comprising measuring levels of nucleic acid molecules in a 
sample after the sample is treated with a test compound or molecule. Levels of nucleic acid molecules in 
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a sample so treated are then compared with the plurality of the arrayed nucleic acid molecules to identify 
which sample nucleic acid molecules are upregulated and downregulated by the test compound or 
molecule. In one embodiment, the nucleic acid molecules are hybridizable array elements of a 
microarray. 

5 Alternatively, the invention provides methods for screening a sample for a metabolic response to 

a test compound or molecule. 

Alternatively, the invention provides methods for screening a test compound or molecule for a 
previously unknown metabolic response. 

In another aspect, the invention provides methods for preventing a toxicological response by 
10 administering complementary nucleotide molecules against one or more selected upregulated nucleic acid 
molecules or a ribozyme that specifically cleaves such molecules. Alternatively, a toxicological response 
may be prevented by administering sense nucleotide molecules for one or more selected downregulated 
nucleic acid molecules. 

In yet another aspect, the invention provides methods for preventing a toxicological response by 
15 administering an agonist which initiates transcription of a gene comprising a downregulated nucleic acid 
molecule of the invention. Alternatively, a toxicological response may be prevented by administering an 
antagonist which prevents transcription of a gene comprising an upregulated nucleic acid molecule of the 
invention. 

In another aspect, the invention provides nucleic acid molecules whose transcript levels are 
20 modulated in a sample during a metabolic response to a toxic compound or molecule. The invention also 
provides nucleic acid molecules whose transcript levels are upregulated in a sample during a metabolic 
response to a toxic compound or molecule. The invention also provides nucleic acid molecules whose 
transcript levels are downregulated in a sample during a metabolic response to a toxic compound or 
molecule. Upregulation or downregulation is at least 2-fold, more preferably at least 2.5-fold, even more 
25 preferably at least 3-fold. The metabolic response to a toxic compound or molecule may be a 

toxicological response. The invention also provides mammalian nucleic acid molecules which are 
homologous to the upregulated and downregulated nucleic acid molecules. In one aspect, preferred 
arrayed nucleic acid molecules are selected from the group consisting of SEQ ID NOs:l-47, or fragments 
thereof. 

3 0 The invention also provides a method for using a molecule selected from SEQ ID NOs: 1-59 or a 

portion thereof to screen a library of molecules to identify at least one ligand which specifically binds the 
selected molecule, the method comprising combining the selected molecule with the library of molecules 
under conditions allowing specific binding, and detecting specific binding, thereby identifying a ligand 
which specifically binds the selected molecule. 
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Such libraries include DNA and RNA molecules, peptides, peptide nucleic acids, agonists, 
antagonists, antibodies, immunoglobulins, drug compounds, pharmaceutical agents, and other ligands. In 
one aspect, the ligand identified using the method modulates the activity of the selected molecule. In an 
analogous method, the selected molecule or a portion thereof is used to purify a ligand. The method 
5 involves combining the selected molecule or a portion thereof with a sample under conditions to allow 
specific binding, detecting specific binding between the selected molecule and ligand, recovering the 
bound selected molecule, and separating the selected molecule from the ligand to obtain purified ligand. 
The invention further provides a method for using at least a portion of the proteins encoded by SEQ ID 
NOs:l-47 and the proteins of SEQ ID NOs:48-59 to produce antibodies. 

10 The invention further provides a method for inserting a marker gene into the genomic DNA of an 

animal to disrupt the expression of the natural nucleic acid molecule. The invention also provides a 
method for using the nucleic acid molecule to produce an animal model system, the method comprising , 
constructing a vector containing the nucleic acid molecule; introducing the vector into a totipotent 
embryonic stem cell; selecting an embryonic stem cell with the vector integrated into genomic DNA; 

15 microinjecting the selected cell into a blastocyst, thereby forming a chimeric blastocyst; transferring the 
chimeric blastocyst into a pseudopregnant dam, wherein the dam gives birth to a chimeric animal 
containing at least one additional copy of nucleic acid molecule in its germ line; and breeding the 
chimeric animal to generate a homozygous animal model system. 

The invention also provides a substantially purified mammalian protein or a portion thereof. The 

20 invention further provides isolated and purified proteins encoded by the nucleic acid molecules of SEQ 
ID NOs: 1-1 1, 17-33, 36, 39, and 41 . The invention further provides isolated and purified protein 
molecule of SEQ ID NOs:50 and 53. Additionally, the invention provides a pharmaceutical composition 
comprising a substantially purified mammalian protein or a portion thereof in conjunction with a 
pharmaceutical carrier. 

25 The invention further provides an isolated and purified mammalian nucleic acid molecule variant 

having at least 10% nucleic acid sequence identity to the mammalian nucleic acid molecule selected from 
SEQ ID NO: 1-47 and fragments thereof. The invention also provides an isolated and purified nucleic 
acid molecule having a sequence which is complementary to the mammalian nucleic acid molecule 
comprising a nucleic acid molecule selected from SEQ ID NO: 1-47 and fragments thereof. 

3 0 The invention further provides an expression vector containing at least a fragment of the 

mammalian nucleic acid molecule selected from the group consisting of SEQ ID NOs: 1-47. In another 
aspect, the expression vector is contained within a host cell. 

The invention also provides a method for producing a mammalian protein, the method 
comprising the steps of: (a) culturing the host cell containing an expression vector containing a 
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mammalian nucleic acid molecule of the invention under conditions suitable for the expression of the 
polypeptide; and (b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially purified 
mammalian protein encoded by SEQ ID NOs:l-l 1, 17-33, 36, 39, and 41 and the amino acid sequence of 
5 SEQ ID NOs:50 and 53 and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes an isolated and purified antibody which binds to a mammalian 
protein encoded by SEQ ID NOs: 1-1 1, 17-33, 36, 39, and 41 and mammalian protein of SEQ ID NOs:50 
and 53 or fragments thereof. The invention also provides a purified agonist and a purified antagonist. 

10 BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

A portion of the disclosure of this patent document contains material which is subject to 

copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of 

the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file 

or records, but otherwise reserves all copyright rights whatsoever. 
15 The Sequence Listing contains the nucleic acid sequence of exemplary mammalian nucleic acid 

molecules of the invention, SEQ ID NOs:l-47, 60-135, 137, and 138; the protein sequence of exemplary 

mammalian protein molecules of the invention, SEQ ID NOs:48-59 and 136. 

MODES FOR CARRYING OUT THE INVENTION 

20 Definitions 

"Sample" is used in its broadest sense. A sample containing nucleic acid molecules may 
comprise a bodily fluid; a cell; an extract from a cell, chromosome, organelle, or membrane isolated from 
a cell; genomic DNA, RNA, or cDNA in solution or bound to a substrate; a biological tissue or biopsy 
thereof; a fingerprint or tissue print; natural or synthetic fibres; in a solution; in a liquid suspension; in a 
25 gaseous suspension; in an aerosol; and the like. 

"Plurality" refers preferably to a group of one or more members, preferably to a group of at least 
about 10, and more preferably to a group of at least about 100 members, and even more preferably a 
group of 10,000 members. 

"Substrate" refers to a rigid or semi-rigid support to which nucleic acid molecules or proteins are 
30 bound and includes membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, 
capillaries or other tubing, plates, polymers, and microparticles with a variety of surface forms including 
wells, trenches, pins, channels and pores. 

"Modulates" refers to a change in activity (biological, chemical, or immunological) or lifespan 
resulting from specific binding between a molecule and either a nucleic acid molecule or a protein. 
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"Microarray" refers to an ordered arrangement of hybridizable array elements on a substrate. The 
array elements are arranged so that there are preferably at least ten or more different array elements, more 
preferably at least 100 array elements, even more preferably at least 1000 array elements, and most 
preferably 10,000. Furthermore, the hybridization signal from each of the array elements is individually 
5 distinguishable. In a preferred embodiment, the array elements comprise nucleic acid molecules. 

"Nucleic acid molecule" refers to a nucleic acid, oligonucleotide, nucleotide, polynucleotide or 
any fragment thereof. It may be DNA or RNA of genomic or synthetic origin, double-stranded or 
single-stranded, and combined with carbohydrate, lipids, protein, or other materials to perform a 
particular activity such as transformation or form a useful composition such as a peptide nucleic acid 
10 (PNA). "Oligonucleotide" is substantially equivalent to the terms amplimer, primer, oligomer, element, 
target, and probe and is preferably single stranded. 

"Protein" refers to an amino acid sequence, oligopeptide, peptide, polypeptide, or portions 
thereof whether naturally occurring or synthetic. Exemplary portions are the first twenty consecutive 
amino acids of a mammalian protein encoded by SEQ ED NOs:l-l 1, 17-33, 36, 39, and 41 and 
15 mammalian protein of SEQ ID NOs:50 and 53. 

"Up-regulated" refers to a nucleic acid molecule whose levels increased in a treated sample 
compared with the nucleic acid molecule in an untreated sample. 

"Down-regulated" refers to nucleic acid molecule whose levels decreased in a treated sample 
compared with the nucleic acid molecule in an untreated sample. 
20 "Toxic compound" or "toxic agent" is any compound, molecule, or agent that elicits a 

biochemical, metabolic, and physiological response in an individual or animal, such as i) DNA damage, 
ii) cell damage, iii) organ damage or cell death, or iv) clinical morbidity or mortality. 

"Toxicological response" refers to a biochemical, metabolic, and physiological response in an 
individual or animal which has been exposed to a toxic compound or agent. 
2 5 "Fragment" refers to an Incyte clone or any part of a molecule which retains a usable, functional 

characteristic. Useful fragments include oligonucleotides and polynucleotides which may be used in 
hybridization or amplification technologies or in regulation of replication, transcription or translation. 
Exemplary fragments are the first sixty consecutive nucleotides of SEQ ID NOs:l-47. Useful fragments 
also include polypeptides and protein molecules which have antigenic potential and which may be used 
30 with a suitable pharmaceutical carrier in a pharmaceutical composition. Exemplary fragments are the 
first twenty consecutive amino acids of a mammalian protein encoded by SEQ ID NOs: 1-1 1, 17-33, 36, 
39, and 41 and mammalian protein of SEQ ID NOs:50 and 53. 

"Hybridization complex" refers to a complex between two nucleic acid molecules by virtue of the 
formation of hydrogen bonds between purines and pyrimidines. 
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"Ligand" refers to any compound, molecule, or agent which will bind specifically to a 
complementary site on a nucleic acid molecule or protein. Such ligands stabilize or modulate the activity 
of nucleic acid molecules or proteins of the invention and may be composed of at least one of the 
following: inorganic and organic substances including nucleic acids, proteins, carbohydrates, fats, and 
5 lipids. 

"Percent identity" or "% identity" refers to the percentage of sequence similarity found in a 
comparison of two or more amino acid or nucleic acid sequences. Percent identity can be determined 
electronically, e.g., by using the MEGALIGN program (DNASTAR, Madison WI) which creates 
alignments between two or more sequences according to methods selected by the user, e.g., the clustal 

10 method. (See, e.g., Higgins, D.G. and P.M. Sharp (1988) Gene 73:237-244.) The clustal algorithm 
groups sequences into clusters by examining the distances between all pairs. The clusters are aligned 
pairwise and then in groups. The percentage similarity between two amino acid sequences, e.g., 
sequence A and sequence B, is calculated by dividing the length of sequence A, minus the number of gap 
residues in sequence A, minus the number of gap residues in sequence B, into the sum of the residue 

15 matches between sequence A and sequence B, times one hundred. Gaps of low or of no similarity 

between the two amino acid sequences are not included in determining percentage similarity. Percent 
identity between nucleic acid sequences can also be counted or calculated by other methods known in the 
art, e.g., the Jotun Hein method. (See, e.g., Hein, J. (1990) Methods Enzymol. 183:626-645.) Identity 
between sequences can also be determined by other methods known in the art, e.g., by varying 

2 0 hybridization conditions. 

"Substantially purified" refers to nucleic acid molecules or proteins that are removed from their 
natural environment and are isolated or separated, and are at least about 60% free, preferably about 75% 
free, and most preferably about 90% free, from other components with which they are naturally 
associated. 

25 

The Invention 

The present invention provides mammalian nucleic acid and protein molecules and method of 
using the nucleic acid molecules for screening test compounds and molecules for toxicological responses. 
Additionally the invention provides methods for characterizing the toxicological responses of a sample 
30 to a test compound or molecule. In particular, the present invention provides a composition comprising a 
plurality of nucleic acid molecules derived from human cDNA libraries, monkey cDNA libraries, mouse 
cDNA libraries, normal rat liver cDNA libraries, normalized rat liver cDNA libraries, prehybridized rat 
liver cDNA libraries, subtracted rat liver cDNA libraries, and rat kidney cDNA libraries. The nucleic 
acid molecules have been further selected for exhibiting upregulated or downregulated gene expression 



8 



WO 01/36684 



PCT/US00/31743 



in rat livers when the rats have been exposed to a known hepatotoxin, including a peroxisomal 
proliferator (PP), acetaminophen or one of its corresponding metabolites, a polycyclic aromatic 
hydrocarbon (PAH), carbon tetrachloride, hydrazine, a-naphthylisothiocyanate, 4-acetylaminofluorene, 
and their corresponding metabolites. 
5 PPs include hypolipidemic drugs, such as clofibrate, fenofibrate, clofenic acid, nafenopin, 

gemfibrozil, ciprofibrate, bezafibrate, halofenate, simfibrate, benzofibrate, etofibrate, WY- 14,643, and 
the like; n-alkylcarboxylic acids, such as trichloroacetic acid, valproic acid, hexanoic acid, and the like; 
n-alkylcarboxylic acid precursors, such as trichloroethylene, etrachloroethylene, and the like; azole 
antifungal compounds, such as bifonazole, and the like; leukotriene D4 antagonists; herbicides; 
10 pesticides; phthalate esters, such as di-[2-ethylhexyl] phthalate, mono-[2-ethylhexyl] phthalate, and the 
like; and natural chemicals, such as phenyl acetate, dehydroepiandrosterone (DHEA), oleic acid, 
methanol, and the like. In a preferred embodiment the toxin is clofibrate, or one of its corresponding 
metabolites. In another prefered embodiment the toxin is fenofibrate, or one of its corresponding 
metabolites. 

15 PAHs include compounds such as benzo(a)pyrene, 3-methylcholanthrene, benz(a)anthracene, 

7,12-dimethylbenz(a)anthracene, their corresponding metabolites, and the like. In a preferred 
embodiment the toxin is benzo(a)pyrene, or one of its corresponding metabolites. 

SEQ ID NOs: 1-16 were identified by their pattern of at least two-fold upregulation or 
downregulation following hybridization with sample nucleic acid molecules from rat liver tissue treated 

20 with a known toxic compound. SEQ ID NOs: 17-47 were identified by their homology to the sample 
nucleic acid molecules from rat liver tissue treated with a known toxic compound. These and other 
nucleic acid molecules can be immobilized on a substrate as hybridizable array elements in a microarray 
format. The microarray may be used to characterize gene expression patterns associated with novel 
compounds to elucidate any toxicological responses or to monitor the effects of treatments during clinical 

25 trials or therapy where metabolic responses to toxic compounds may be expected. 

When the nucleic acid molecules are employed as hybridizable array elements in a microarray, 
the array elements are organized in an ordered fashion so that each element is present at a specified 
location on the substrate. Because the array elements are at specified locations on the substrate, the 
hybridization patterns and intensities (which together create a unique expression profile) can be 

30 interpreted in terms of expression levels of particular genes and can be correlated with a toxicological 
response associated with a test compound or molecule. 

The invention also provides a substantially purified and isolated mammalian protein comprising 
the protein molecule of SEQ ID NOs:50 and 53 or portion thereof. The invention further provides 
isolated and purified proteins encoded by the nucleic acid molecules of SEQ ID NOs: 1-1 1, 17-33, 36, 39, 
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and 41, or portion thereof. 

Furthermore, the present invention provides methods for screening test compounds or 
therapeutics for potential toxicological responses and for screening a sample's toxicological response to a 
particular test compound or molecule. Briefly, these methods entail treating a sample with the test 
5 compound or molecule to elicit a change in gene expression patterns comprising the expression of a 
plurality of sample nucleic acid molecules. Nucleic acid molecules are selected by identifying those 
genes in rat liver or kidney that are upregulated-or-downregulated at least 2-fold, more preferably at least 
2.5-fold, most preferably at least 3-fold, when treated with a known toxic compound or molecule. The 
nucleic acid molecules are arrayed on a substrate. Then, the arrayed nucleic acid molecules and sample 

10 nucleic acid molecules are combined under conditions effective to form hybridization complexes which 
may be detected by methods well known in the art. Detection of higher or lower levels of such 
hybridization complexes compared with hybridization complexes derived from untreated samples and 
samples treated with a compound that is known not to induce a toxicological response correlates with a 
toxicological response of a test compound or a toxicological response to a molecule. 

15 Complementary DNA libraries 

Molecules are identified that reflect all or most of the genes that are expressed in rat liver or 
kidney. Molecules may be identified by isolating clones derived from several types of rat cDNA 
libraries, including normal rat cDNA libraries, normalized rat cDNA libraries, prehybridized rat cDNA 
libraries, and subtracted cDNA libraries. Clone inserts derived from these clones may be partially 

20 sequenced to generate expressed sequence tags (ESTs). Molecules are also identified by comparing the 
clones from rat cDNA libraries with clones from human, monkey, and mouse cDNA libraries using 
computer software nucleic acid comparison programs such as BLAST (see, e.g., Altschul, S.F. (1993) J. 
Mol. Evol. 3:290-300; Altschul, eiaL (1990) J. Mol. Biol. 215:403-410). 

In one embodiment, two collections of ESTs are identified and sequenced. A first collection of 

25 ESTs (the originator molecules) are derived from rat liver and kidney and are derived from the cDNA 
libraries presented in the Examples. A second collection includes ESTs derived from other rat cDNA 
libraries available in the ZOOSEQ database (Incyte Pharmaceuticals, Inc. Palo Alto CA). 

The two collections of ESTs are clustered electronically to form master clusters of ESTs. Master 
clusters are formed by identifying overlapping EST molecules and assembling these ESTs. A nucleic 

3 0 acid fragment assembly tool, such as the Phrap tool (Phil Green, University of Washington) and the 

GELVIEW fragment assembly system (GCG, Madison WI), can be used for this purpose. The minimum 
number of clones which constitute a cluster is two. In another embodiment, a collection of human genes 
known to be expressed in response to toxic agents are used to select representative ESTs from the 113 rat 
cDNA libraries. The master cluster process is repeated for these molecules. 
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After assembling the clustered consensus nucleic acid sequences, a representative 5' clone is 
nominated from each master cluster. The most 5 1 clone is preferred because it is most likely to contain 
the complete gene. The nomination process is described in greater detail in "Relational Database and 
System for Storing Information Relating to Biomolecular Sequences and Reagents", USSN 09/034,807, 
5 filed March 4, 1998, herein incorporated in its entirety by reference. The EST molecules are used as 
array elements on a microarray. 
Selection of arrayed nucleic acid molecules 

Samples are treated, preferably at subchronic doses, with one or more known toxic compounds 
over a defined time course. Preferably, the agents are peroxisomal proliferators (PPs), acetaminophen or 
10 one of its corresponding metabolites, polycyclic aromatic hydrocarbons (PAHs), carbon tetrachloride, 
hydrazine, a-naphthylisothiocyanate, 4-acetylaminofluorene, or their corresponding metabolites. 

The gene expression patterns derived from such treated biological samples can be compared with 
the gene expression patterns derived from untreated biological samples to identify and select nucleic acid 
molecules whose expression is either upregulated or downregulated due to the response to the toxic 
15 compounds. These selected molecules may then be employed as array elements alone or in combination 
with other array element molecules. Such a microarray is particularly useful to detect and characterize 
gene expression patterns associated with known toxic compounds. Such gene expression patterns can 
then be used for comparison to identify other compounds which also elicit a toxicological response. 

The arrayed nucleic acid molecules can be manipulated to optimize their performance in 
20 hybridization. To optimize hybridization, the arrayed nucleic acid molecules are examined using a 

computer algorithm to identify portions of genes without potential secondary structure. Such computer 
algorithms are well known in the art and are part of OLIGO 4.06 primer analysis software (National 
Biosciences, Plymouth MN) or LASERGENE software (DNASTAR, Madison WI). These programs can 
search within nucleic acid sequences to identify stem loop structures and tandem repeats and to analyze G 
25 + C content of the sequence (those molecules with a G + C content greater than 60% are excluded). 

Alternatively, the arrayed nucleic acid molecules can be optimized by trial and error. Experiments can be 
performed to determine whether sample nucleic acid molecules and complementary arrayed nucleic acid 
molecules hybridize optimally under experimental conditions. 

The arrayed nucleic acid molecules can be any RNA-like or DNA-like material, such as mRNAs, 
3 0 cDNAs, genomic DNA, peptide nucleic acids, branched DNAs and the like. The arrayed nucleic acid 
molecules can be in sense or antisense orientations. 

In one embodiment, the arrayed nucleic acid molecules are cDNAs. The size of the DNA 
sequence of interest may vary, and is preferably from 50 to 10,000 nucleotides, more preferably from 150 
to 3,500 nucleotides. In a second embodiment, the nucleic acid molecules are vector DNAs. In this case 
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the size of the DNA sequence of interest, i.e., the insert sequence, may vary from about 50 to 10,000 
nucleotides, more preferably from about 150 to 3,500 nucleotides. 

The nucleic acid molecule sequences of the Sequence Listing have been prepared by current, 
state-of-the-art, automated methods and, as such, may contain occasional sequencing errors and 
5 unidentified nucleotides. Nucleotide analogues can be incorporated into the nucleic acid molecules by 
methods well known in the art. The only requirement is that the incorporated nucleotide analogues must 
serve to base pair with sample nucleic acid molecules. For example, certain guanine nucleotides can be 
substituted with hypoxanthine which base pairs with cytosine residues. However, these base pairs are 
less stable than those between guanine and cytosine. Alternatively, adenine nucleotides can be 
10 substituted with 2,6-diaminopurine which can form stronger base pairs than those between adenine and 
thymidine. Additionally, the nucleic acid molecules can include nucleotides that have been derivatized 
chemically or enzymatically. Typical modifications include derivatization with acyl, alkyl, aryl or amino 
groups. 

The nucleic acid molecules can be immobilized on a substrate via chemical bonding. 

15 Furthermore, the molecules do not have to be directly bound to the substrate, but rather can be bound to 
the substrate through a linker group. The linker groups are typically about 6 to 50 atoms long to provide 
exposure to the bound nucleic acid molecule. Preferred linker groups include ethylene glycol oligomers, 
diamines, diacids and the like. Reactive groups on the substrate surface react with one of the terminal 
portions of the linker to bind the linker to the substrate. The other terminal portion of the linker is then 

20 functionalized for binding the nucleic acid molecule. Preferred substrates are any suitable rigid or 

semirigid support, including membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic 
beads, gels, tubing, plates, polymers, microparticles and capillaries. The substrate can have a variety of 
surface forms, such as wells, trenches, pins, channels and pores, to which the arrayed nucleic acid 
molecules are bound. 

25 The samples can be any sample comprising sample nucleic acid molecules and obtained from any 

bodily fluid (blood, urine, saliva, phlegm, gastric juices, etc.), cultured cells, biopsies, or other tissue 
preparations. The samples can be derived from any species, but preferably from eukaryotic species, and 
more preferably from mammalian species such as rat and human. 

DNA or RNA can be isolated from the sample according to any of a number of methods well 

30 known to those of skill in the art. For example, methods of purification of nucleic acids are described in 
Tijssen, P. (1993) Laboratory Techniques in Biochemistry and Molecular Biology: Hybridization With 
Nucleic Acid Probes. Part I. Theory and Nucleic Acid Preparation . Elsevier, New York, NY. In one 
preferred embodiment, total RNA is isolated using the TRIZOL total RNA isolation reagent (Life 
Technologies, Inc., Gaithersburg MD) and mRNA is isolated using oligo d(T) column chromatography or 
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glass beads. When sample nucleic acid molecules are amplified it is desirable to amplify the sample 
nucleic acid molecules and maintain the relative abundances of the original sample, including low 
abundance transcripts. RNA can be amplified in vitro , in situ , or in vivo (See Eberwine US Patent No. 
5,514,545). 

S It is also advantageous to include controls within the sample to assure that amplification and 

labeling procedures do not change the true distribution of nucleic acid molecules in a sample. For this 
purpose, a sample is spiked with an amount of a control nucleic acid molecule predetermined to be 
detectable upon hybridization to its complementary arrayed nucleic acid molecule and the composition of 
nucleic acid molecules includes reference nucleic acid molecules which specifically hybridize with the 
10 control arrayed nucleic acid molecules. After hybridization and processing, the hybridization signals 
obtained should reflect accurately the amounts of control arrayed nucleic acid molecules added to the 
sample. 

Prior to hybridization, it may be desirable to fragment the sample nucleic acid molecules. 
Fragmentation improves hybridization by minimizing secondary structure and cross-hybridization to 
15 other sample nucleic acid molecules in the sample or noncomplementary nucleic acid molecules. 
Fragmentation can be performed by mechanical or chemical means. 
Labeling 

The sample nucleic acid molecules may be labeled with one or more labeling moieties to allow 
for detection of hybridized arrayed/sample nucleic acid molecule complexes. The labeling moieties can 

20 include compositions that can be detected by spectroscopic, photochemical, biochemical, bioelectronic, 
immunochemical, electrical, optical or chemical means. The labeling moieties include radioisotopes, 
such as 32 P, 33 P or 35 S, chemiluminescent compounds, labeled binding proteins, heavy metal atoms, 
spectroscopic markers, such as fluorescent markers and dyes, magnetic labels, linked enzymes, mass 
spectrometry tags, spin labels, electron transfer donors and acceptors, and the like. Preferred fluorescent 

25 markers include Cy3 and Cy5 fluorophores (Amersham Pharmacia Biotech, Piscataway NJ). 
Hybridization 

The nulceic acid molecule sequence of SEQ ID NOs:l-47 and fragments thereof can be used in 
various hybridization technologies for various purposes. Hybridization probes may be designed or 
derived from SEQ ID NOs: 1-47. Such probes may be made from a highly specific region such as the 5' 
3 0 regulatory region or from a conserved motif, and used in protocols to identify naturally occurring 

sequences encoding the mammalian protein, allelic variants, or related sequences, and should preferably 
have at least 50% sequence identity to any of the protein sequences. The hybridization probes of the 
subject invention may be DNA or RNA and may be derived from the sequence of SEQ ID NOs:l-47 or 
from genomic sequences including promoters, enhancers, and introns of the mammalian gene. 
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Hybridization or PCR probes may be produced using oligolabeling, nick translation, end-labeling, or PCR 
amplification in the presence of the labeled nucleotide. A vector containing the nucleic acid sequence 
may be used to produce an mRNA probe in vitro by addition of an RNA polymerase and labeled nucleic 
acid molecules. These procedures may be conducted using commercially available kits such as those 
5 provided by Amersham Pharmacia Biotech. 

The stringency of hybridization is determined by G+C content of the probe, salt concentration, 
and temperature. In particular, stringency can be increased by reducing the concentration of salt or 
raising the hybridization temperature. In solutions used for some membrane based hybridizations, 
additions of an organic solvent such as formamide allows the reaction to occur at a lower temperature. 

10 Hybridization can be performed at low stringency with buffers, such as 5 x SSC with 1% sodium dodecyl 
sulfate (SDS) at 60°C, which permits the formation of a hybridization complex between nucleotide 
sequences that contain some mismatches. Subsequent washes are performed at higher stringency with 
buffers such as 0.2 x SSC with 0.1% SDS at either 45°C (medium stringency) or 68°C (high stringency). 
At high stringency, hybridization complexes will remain stable only where the nucleic acid sequences are 

15 completely complementary. In some membrane-based hybridizations, preferably 35% or most preferably 
50%, formamide can be added to the hybridization solution to reduce the temperature at which 
hybridization is performed, and background signals can be reduced by the use of other detergents such as 
Sarkosyl or Triton X-100 and a blocking agent such as salmon sperm DNA. Selection of components and 
conditions for hybridization are well known to those skilled in the art and are reviewed in Ausubel 

2 0 (supra) and Sambrook et al . ( 1 989) Molecular Cloning. A Laboratory Manual , Cold Spring Harbor Press, 
Plainview NY. 

Hybridization specificity can be evaluated by comparing the hybridization of specificity-control 
nucleic acid molecules to specificity-control sample nucleic acid molecules that are added to a sample in 
a known amount. The specificity-control arrayed nucleic acid molecules may have one or more sequence 

25 mismatches compared with the corresponding arrayed nucleic acid molecules. In this manner, whether 
only complementary arrayed nucleic acid molecules are hybridizing to the sample nucleic acid molecules 
or whether mismatched hybrid duplexes are forming is determined. 

Hybridization reactions can be performed in absolute or differential hybridization formats. In the 
absolute hybridization format, nucleic acid molecules from one sample are hybridized to the molecules in 

30 a microarray format and signals detected after hybridization complex formation correlate to nucleic acid 
molecule levels in a sample. In the differential hybridization format, the differential expression of a set 
of genes in two biological samples is analyzed. For differential hybridization, nucleic acid molecules 
from both biological samples are prepared and labeled with different labeling moieties. A mixture of the 
two labeled nucleic acid molecules is added to a microarray. The microarray is then examined under 
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conditions in which the emissions from the two different labels are individually detectable. Molecules in 
the microarray that are hybridized to substantially equal numbers of nucleic acid molecules derived from 
both biological samples give a distinct combined fluorescence (Shalon et al. PCT publication 
WO95/35505). In a preferred embodiment, the labels are fluorescent markers with distinguishable 
5 emission spectra, such as Cy3 and Cy5 fluorophores. 

After hybridization, the microarray is washed to remove nonhybridized nucleic acid molecules 
and complex formation between the hybridizable array elements and the nucleic acid molecules is 
detected. Methods for detecting complex formation are well known to those skilled in the art. In a 
preferred embodiment, the nucleic acid molecules are labeled with a fluorescent label and measurement 

10 of levels and patterns of fluorescence indicative of complex formation is accomplished by fluorescence 
microscopy, preferably confocal fluorescence microscopy. 

In a differential hybridization experiment, nucleic acid molecules from two or more different 
biological samples are labeled with two or more different fluorescent labels with different emission 
wavelengths. Fluorescent signals are detected separately with different photomultipliers set to detect 

15 specific wavelengths. The relative abundances/expression levels of the nucleic acid molecules in two or 
more samples is obtained. 

Typically, microarray fluorescence intensities can be normalized to take into account variations 
in hybridization intensities when more than one microarray is used under similar test conditions. In a 
preferred embodiment, individual array ed-sample nucleic acid molecule complex hybridization intensities 

20 are normalized using the intensities derived from internal normalization controls contained on each 
microarray. 

The labeled sample emits specific wavelengths which are detected using a plurality of 
photomultipliers. The nucleic acid molecules whose relative abundance/expression levels are modulated 
by treatment of a sample with a known toxic compound can be used as hybridizable elements in a 

25 microarray. Such a microarray can be employed to identify expression profiles associated with particular 
toxicological responses. Then, a particular subset of these photomultipliers set to detect specific 
wavelengths. The relative expression levels of the arrayed nucleic acid molecules can be identified as to 
which arrayed nucleic acid molecule expression is modulated in response to a particular toxicological 
agent. These photomultipliers are set to detect specific wavelengths. The relative expression levels of 

30 the nucleic acid molecules can be employed to identify other compounds with a similar toxicological 
response. 

Alternatively, for some treatments with known side effects, the microarray, and expression 
patterns derived therefrom, is employed to prospectively define the treatment regimen. A dosage is 
established that minimizes expression patterns associated with undesirable side effects. This approach 
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may be more sensitive and rapid than waiting for the patient to show toxicological side effects before 
altering the course of treatment. 

Generally, the method for screening a library of test compounds or molecules to identify those 
with a toxicological response entails selecting a plurality of arrayed genes whose expression levels are 
5 modulated in tissues treated with known toxic compounds when compared with untreated tissues. Then a 
sample is treated with the test compound or molecule to induce a pattern of gene expression comprising 
the expression of a plurality of sample nucleic acid molecules. Tissues from a mammalian subject treated 
at various dosages of the test compound may be screened to determine which doses may be toxic. 

Then, the expression levels of the arrayed genes and the sample nucleic acid molecules are 

10 compared to identify those compounds that induce expression levels of the sample nucleic acid molecules 
that are similar to those of the arrayed genes. In one preferred embodiment, gene expression levels are 
compared by contacting the arrayed genes with the sample nucleic acid molecules under conditions 
effective to form hybridization complexes between arrayed genes and sample nucleic acid molecules; and 
detecting the presence or absence of the hybridization complexes. 

15 Similarity may mean that at least 1, preferably at least 5, more preferably at least 10, of the 

upregulated arrayed genes form hybridization complexes with the sample nucleic acid molecules at least 
once during a time course to a greater extent than would the nucleic acid molecules of a sample not 
treated with the test compound. Similarity may also mean that at least 1, preferably at least 5, more 
preferably at least 10, of the downregulated nucleic acid molecules form hybridization complexes with 

20 the arrayed genes at least once during a time course to a lesser extent than would the nucleic acid 
molecules of a sample not treated with the test compound. 

Such a similarity of expression patterns means that a toxicological response is associated with the 
compound or therapeutic tested. Preferably, the toxic compounds belong to the class of peroxisomal 
proliferators (PPs), including hypolipidemic drugs, such as clofibrate, fenofibrate, clofenic acid, 

25 nafenopin, gemfibrozil, ciprofibrate, bezafibrate, halofenate, simfibrate, benzofibrate, etofibrate, WY- 
14,643, and the like; n-alkylcarboxylic acids, such as trichloroacetic acid, valproic acid, hexanoic acid, 
and the like; n-alkylcarboxylic acid precursors, such as trichloroethylene, etrachloroethylene, and the 
like; azole antifungal compounds, such as bifonazole, and the like; leukotriene D4 antagonists; 
herbicides; pesticides; phthalate esters, such as di-[2-ethylhexyl] phthalate, mono-[2-ethylhexyl] 

30 phthalate, and the like; and natural chemicals, such as phenyl acetate, dehydroepiandrosterone (DHEA), 
oleic acid, methanol, and the like. In another embodiment, the toxic compound is acetaminophen or one 
of its corresponding metabolites. In yet another embodiment, the toxic compounds are polycyclic 
aromatic hydrocarbons (PAHs), including compounds such as benzo(a)pyrene, 3-methylcholanthrene, 
benz(a)anthracene, 7,12-dimethylbenz(a)anthracene, their corresponding metabolites, and the like. Of 



16 



WO 01/36684 



PCT/US00/31743 



particular interest is the study of the toxicological responses of these compounds on the liver, kidney, 

brain, spleen, pancreas, and lung. 

Modification of Gene Expression Using Nucleic Acids 

Gene expression may be modified by designing complementary or antisense molecules (DNA, 
5 RNA, or PNA) to the control, 5\ 3', or other regulatory regions of the mammalian gene. 

Oligonucleotides designed with reference to the transcription initiation site are preferred. Similarly, 
inhibition can be achieved using triple helix base-pairing which inhibits the binding of polymerases, 
transcription factors, or regulatory molecules (Gee et al. In: Huber and Carr (1994) Molecular and 
Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177). A complementary molecule 

10 may also be designed to block translation by preventing binding between ribosomes and mRNA. In one 
alternative, a library of nucleic acid molecules or fragments thereof may be screened to identify those 
which specifically bind a regulatory, nontranslated sequence . 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence- specific hybridization of the ribozyme 

15 molecule to complementary target RNA followed by endonucleolytic cleavage at sites such as GUA, 
GUU, and GUC. Once such sites are identified, an oligonucleotide with the same sequence may be 
evaluated for secondary structural features which would render the oligonucleotide inoperable. The 
suitability of candidate targets may also be evaluated by testing their hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

20 Complementary nucleic acids and ribozymes of the invention may be prepared via recombinant 

expression, in vitro or in vivo , or using solid phase phosphoramidite chemical synthesis. In addition, 
RNA molecules may be modified to increase intracellular stability and half-life by addition of flanking 
sequences at the 5* and/or 3' ends of the molecule or by the use of phosphorothioate or 2* O-methyl rather 
than phosphodiesterase linkages within the backbone of the molecule. Modification is inherent in the 

25 production of PNAs and can be extended to other nucleic acid molecules. Either the inclusion of 
nontraditional bases such as inosine, queosine, and wybutosine, and or the modification of adenine, 
cytidine, guanine, thymine, and uridine with acetyl-, methyl-, thio- groups renders the molecule less 
available to endogenous endonucleases. 
Screening Assays 

30 The nucleic acid molecule encoding the mammalian protein may be used to screen a library of 

molecules for specific binding affinity. The libraries may be DNA molecules, RNA molecules, PNAs, 
peptides, proteins such as transcription factors, enhancers, repressors, and other ligands which regulate 
the activity, replication, transcription, or translation of the nucleic acid molecule in the biological system. 
The assay involves combining the mammalian nucleic acid molecule or a fragment thereof with the 
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library of molecules under conditions allowing specific binding, and detecting specific binding to identify 
at least one molecule which specifically binds the nucleic acid molecule. 

Similarly the mammalian protein or a portion thereof may be used to screen libraries of 
molecules in any of a variety of screening assays. The portion of the protein employed in such screening 
5 may be free in solution, affixed to an abiotic or biotic substrate (e.g. borne on a cell surface), or located 
intracellularly. Specific binding between the protein and molecule may be measured. Depending on the 
kind of library being screened, the assay may be used to identify DNA, RNA, or PNA molecules, 
agonists, antagonists, antibodies, immunoglobulins, inhibitors, peptides, proteins, drugs, or any other 
ligand, which specifically binds the protein. One method for high throughput screening using very small 
10 assay volumes and very small amounts of test compound is described in USPN 5,876,946, incorporated 
herein by reference, which screens large numbers of molecules for enzyme inhibition or receptor binding. 
Purification of Ligand 

The nucleic acid molecule or a fragment thereof may be used to purify a ligand from a sample. A 

method for using a mammalian nucleic acid molecule or a fragment thereof to purify a ligand would 
15 involve combining the nucleic acid molecule or a fragment thereof with a sample under conditions to 

allow specific binding, detecting specific binding, recovering the bound protein, and using an appropriate 

agent to separate the nucleic acid molecule from the purified ligand. 

Similarly, the protein or a portion thereof may be used to purify a ligand from a sample. A 

method for using a mammalian protein or a portion thereof to purify a ligand would involve combining 
20 the protein or a portion thereof with a sample under conditions to allow specific binding, detecting 

specific binding between the protein and ligand, recovering the bound ligand, and using an appropriate 

chaotropic agent to separate the protein from the purified ligand. 

Pharmacology 

Pharmaceutical compositions are those substances wherein the active ingredients are contained in 
25 an effective amount to achieve a desired and intended purpose. The determination of an effective dose is 
well within the capability of those skilled in the art. For any compound, the therapeutically effective 
dose may be estimated initially either in cell culture assays or in animal models. The animal model is 
also used to achieve a desirable concentration range and route of administration. Such information may 
then be used to determine useful doses and routes for administration in humans. 
3 0 A therapeutically effective dose refers to that amount of protein or inhibitor which ameliorates 

the symptoms or condition. Therapeutic efficacy and toxicity of such agents may be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., ED 50 (the dose 
therapeutically effective in 50% of the population) and LD 50 (the dose lethal to 50% of the population). 
The dose ratio between toxic and therapeutic effects is the therapeutic index, and it may be expressed as 
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the ratio, LD 5O /ED 50 . Pharmaceutical compositions which exhibit large therapeutic indexes are preferred. 
The data obtained from cell culture assays and animal studies are used in formulating a range of dosage 
for human use. 

5 MODEL SYSTEMS 

Animal models may be used as bioassays where they exhibit a toxic response similar to that of 
humans and where exposure conditions are relevant to human exposures. Mammals are the most 
common models, and most toxicity studies are performed on rodents such as rats or mice because of low 
cost, availability, and abundant reference toxicology. Inbred or outbred rodent strains provide a 
10 convenient model for investigation of the physiological consequences of under- or over-expression of 

genes of interest and for the development of methods for diagnosis and treatment of diseases. A mammal 
inbred to over-express a particular gene, so that the protein is secreted in milk, may also serve as a 
convenient source of the protein expressed by that gene. 
Toxicology 

15 Toxicology is the study of the effects of test compounds, molecules, or toxic agents on living 

systems to identify adverse effects. The majority of toxicity studies are performed on rats or mice to help 
predict whether adverse effects of agents will occur in humans. Observation of qualitative and 
quantitative changes in physiology, behavior, homeostatic, developmental, and reproductive processes, 
and lethality are used to generate profiles of safe or toxic responses and to assess the consequences on 

20 human health following exposure to the agent. 

Toxicological tests measure the effects of a single, repeated, or long-term exposure of a subject to 
a substance. Substances may be tested for specific endpoints such as cytotoxicity, mutagenicity, 
carcinogenicity and teratogenicity. Degree of response varies according to the route of exposure (contact, 
ingestion, injection, or inhalation), age, sex, genetic makeup, and health status of the subject. Other tests 

25 establish the toxicokinetic and toxicodynamic properties of substances. Toxicokinetic studies trace the 
absorption, distribution in subject tissues, metabolism, storage, and excretion of substances. 
Toxicodynamic studies chart biological responses that are consequences of the presence of the substance 
in the subject tissues. 

Genetic toxicology identifies and analyzes the ability of an agent to produce damage at a cellular 
3 0 or subcellular level. Such genotoxic agents usually have common chemical or physical properties that 
facilitate interaction with nucleic acids and are most harmful when mutated chromosomes are passed 
along to progeny. Toxicological studies may identify agents that increase the frequency of structural or 
functional abnormalities in progeny if administered to either parent before conception, to the mother 
during pregnancy, or to the developing organism. Mice and rats are most frequently used in these tests 
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because of their short reproductive cycle which allows investigators to breed sufficient quantities of 
individual animals to satisfy statistical requirements. 

All types of toxicology studies on experimental animals involve preparation of a suitable form of 
the compound for administration, selection of the route of administration, and selection of a species 
5 which resembles the species of pharmacological interest. Dose concentrations of the compound are 
varied to identify, measure, and investigate a range of dose-related effects related to exposure. 

Acute toxicity tests are based on a single administration of the agent to the subject to determine 
the symptomology or lethality of the agent. Three experiments are conducted; an experiment to define 
the initial dose range; an experiment to narrow the range of effective doses; and a final experiment to 
10 establish the dose-response curve. 

Prolonged and subchronic toxicity tests are based on the repeated administration of the agent. 
Rat and dog are commonly used in these studies to provide data from species in different taxonomic 
orders. With the exception of carcinogenesis, there is considerable evidence that daily administration of 
an agent at high-dose concentrations for periods of three to four months will reveal most forms of toxicity 
15 in adult animals. 

Chronic toxicity tests, with a duration of a year or more, are used to demonstrate either the 
absence of toxicity or the carcinogenic potential of an agent. When studies are conducted on rats, a 
minimum of at least one test group plus one control group are used. Animals are quarantined, examined 
for health, and monitored at the outset and at intervals throughout the experiment. 
20 Transgenic Animal Models 

Transgenic rodents which over-express or under-express a gene of interest may be inbred and 
used to model human diseases or to test therapeutic or toxic agents. (See USPN 4,736,866; USPN 
5,175,383; and USPN 5,767,337; incorporated herein by reference). In some cases, the introduced gene 
may be activated at a specific time in a specific tissue type during fetal development or postnatally, 
25 Expression of the transgene is monitored by analysis of phenotype or tissue-specific mRNA expression, 
in transgenic animals before, during, and after being challenged with experimental drug therapies. 
Embryonic Stem Cells 

Embryonic stem cells (ES) isolated from rodent embryos retain the potential to form an embryo. 
When ES cells are placed inside a carrier embryo, they resume normal development and contribute to all 
30 tissues of the live-born animal. ES cells are the preferred cells used in the creation of experimental 

knockout and knockin rodent strains. Mouse ES cells, such as the mouse 129/SvJ cell line, are derived 
from the early mouse embryo and are grown under culture conditions well known in the art. Vectors for 
knockout strains contain a disease gene candidate modified to include a marker gene which disrupts 
transcription and/or translation of the endogenous disease candidate gene in vivo . The vector is 
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introduced into ES cells by transformation methods such as electroporation, liposome delivery, 
microinjection, and the like which are well known in the art. The endogenous rodent gene is replaced by 
the disrupted disease gene through homologous recombination and integration during cell division. 
Expression of the marker gene confers a selective advantage to the transformed cells when incubated with 
5 an otherwise toxic/lethal selecting agent. Transformed ES cells are selected, identified, and preferably 

microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. The blastocysts 
are surgically transferred to pseudopregnant dams and the resulting chimeric progeny are genotyped and 
bred to produce heterozygous or homozygous strains. 

ES cells are also used to study the differentiation of various cell types and tissues in vitro , such as 

10 neural cells, hematopoietic lineages, and cardiomyocytes (Bain et al. (1995) Dev. Biol. 168:342-357; 

Wiles and Keller (1991) Development 1 1 1 :259-267; and Klug et al. (1996) J. Clin. Invest. 98:216-224). 
Recent developments demonstrate that ES cells derived from human blastocysts may also be manipulated 
in vitro to differentiate into eight separate cell lineages, including endoderm, mesoderm, and ectodermal 
cell types (Thomson et al. (1998) Science 282:1 145-1 147). 

15 Knockout Analysis 

In gene knockout analysis, a region of a human disease gene candidate is enzymatically modified 
to include a non-mammalian gene such as the neomycin phosphotransferase gene (neo\ Capecchi (1989) 
Science 244:1288-1292). The inserted coding sequence disrupts transcription and translation of the 
targeted gene and prevents biochemical synthesis of the disease candidate protein. The modified gene is 

20 transformed into cultured embryonic stem cells (described above), the transformed cells are injected into 
rodent blastulae, and the blastulae are implanted into pseudopregnant dams. Transgenic progeny are 
crossbred to obtain homozygous inbred lines. 
Knockin Analysis 

Totipotent ES cells, present in the early stages of embryonic development, can be used to create 
25 knockin humanized animals (pigs) or transgenic animal models (mice or rats) of human diseases. With 
knockin technology, a region of a human gene is injected into animal ES cells, and the human sequence 
integrates into the animal cell genome by recombination. Totipotent ES cells which contain the 
integrated human gene are handled as described above. Inbred animals are studied and treated to obtain 
information on the analogous human condition. These methods have been used to model several human 
30 diseases. (See, e.g., Lee et al. (1998) Proc. Natl. Acad. Sci. 95:1 1371-1 1376; Baudoin et al. (1998) Genes 
Dev. 12:1202-1216; and Zhuang et al. (1998) Mol. Cell Biol. 18:3340-3349). 
Non-Human Primate Model 

The field of animal testing deals with data and methodology from basic sciences such as 
physiology, genetics, chemistry, pharmacology and statistics. These data are paramount in evaluating the 
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effects of therapeutic agents on non-human primates as they can be related to human health. Monkeys 
are used as human surrogates in vaccine and drug evaluations, and their responses are relevant to human 
exposures under similar conditions. Cynomolgus and Rhesus monkeys (Macaca fascicularis and Macaca 
mulatta . respectively) and Common Marmosets (Callithrix iacchus) are the most common non-human 
5 primates (NHPs) used in these investigations. Since great cost is associated with developing and 

maintaining a colony of NHPs, early research and toxicological studies are usually carried out in rodent 
models. In studies using behavioral measures such as drug addiction, NHPs are the first choice test 
animal. In addition, NHPs and individual humans exhibit differential sensitivities to many drugs and 
toxins and can be classified as a range of phenotypes from "extensive metabolizers" to "poor 
10 metabolizers" of these agents. 

In additional embodiments, the nucleic acid molecules which encode the mammalian protein may 
be used in any molecular biology techniques that have yet to be developed, provided the new techniques 
rely on properties of nucleic acid molecules that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

15 

Examples 

It is understood that this invention is not limited to the particular methodology, protocols, and 
reagents described, as these may vary. It is also understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is not intended to limit the scope of the present 
20 invention which will be limited only by the appended claims. The examples below are provided to best 
describe the subject invention and its representative constituents. 

I cDNA Library Construction 

The RALINOT01 cDNA library was constructed from liver tissue removed from a pool of fifty 
25 10- to 1 1-week-old Sprague-Dawley female rats (Pharmacon, Waverly PA). The animals were housed in 
standard laboratory caging and fed PMI-certified Rodent Diet #5002. The animals appeared to be in good 
health at the time tissue was harvested. The animals were anesthetized by C0 2 inhalation, and then 
cardiocentesis was performed. 

Frozen tissue was homogenized and lysed in TRIZOL reagent (1 g tissue/ 10 ml TRIZOL; Life 
30 Technologies), a monophasic solution of phenol and guanidine isothiocyanate, using a POLYTRON 
homogenizer (PT-3000; Brinkmann Instruments, Westbury NY). After a brief incubation on ice, 
chloroform (1 :5 v/v) was mixed with the reagent, and then centrifuged at 1,000 rpm. The upper aqueous 
layer was removed to a fresh tube, and the RNA precipitated with isopropanol, resuspcnded in DEPC- 
treated water, and treated with DNase I for 25 min at 37°C. The RNA was re-extracted once with 
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phenol-chloroform, pH 4.7, and precipitated using 0.3 M sodium acetate and 2.5 volumes ethanol. The 
mRNA was then isolated using an OLIGOTEX kit (QIAGEN, Chatsworth CA) and used to construct the 
cDNA library. 

The mRNA was handled according to the recommended protocols in the SUPERSCRIPT plasmid 
5 system (Life Technologies). The cDNAs were fractionated on a SEPHAROSE CL-4B column 

(Amersham Pharmacia Biotech), and those cDNAs exceeding 400 bp were Iigated into the pINCYl 
plasmid vector (Incyte Pharmaceuticals). The plasmid pINCYl was subsequently transformed into DH5a 
or DH10B competent cells (Life Technologies). 

The RAKINOT01 library was constructed using mRNA isolated from kidney tissue removed 
10 from a pool of fifty, 7- to 8-week-old male Sprague-Dawley rats, as described above. 

The RAKJNOT02 library was constructed using mRNA isolated from kidney tissue removed 
from a pool of fifty, 10- to 1 1-week-old female Sprague-Dawley rats, as described above. 

II cDNA Library Normalization 

15 In some cases, cDNA libraries were normalized in a single round according to the procedure of 

Soares et al (1994, Proc. Natl. Acad. Sci. 91:9228-9232) with the following modifications. The primer 
to template ratio in the primer extension reaction was increased from 2: 1 to 10: 1 . Reduction of each 
dNTP concentration in the reaction to I50//M allowed the generation of longer (400-1000 nucleotide 
(nt)) primer extension products. The reannealing hybridization was extended from 13 to 19 hours. The 

20 single stranded DNA circles of the normalized library were purified by hydroxy apatite chromatography, 
converted to partially double-stranded by random priming, and electroporated into DH10B competent 
bacteria (Life Technologies). 

The Soares normalization procedure is designed to reduce the initial variation in individual 
cDNA frequencies and to achieve abundances within one order of magnitude while maintaining the 

25 overall sequence complexity of the library. In the normalization process, the prevalence of high- 
abundance cDNA clones decreases significantly, clones with mid-level abundance are relatively 
unaffected, and clones for rare transcripts are increased in abundance. In the modified Soares 
normalization procedure, significantly longer hybridization times are used to increase gene discovery 
rates by biasing the normalized libraries toward low-abundance cDNAs that are well represented in a 

3 0 standard transcript image. 

The RALINON03, RALINON04, and RALINON07 normalized rat liver cDNA libraries were 
constructed with 2.0 x 10 6 , 4.6 x I0 5 , and 2.0 x 10 6 independent clones from the RALINOT01 cDNA 
library, respectively. The RALINOT01 cDNA library was normalized in one round using conditions 
adapted from Soares (supra) except that a significantly longer (48-hour) reannealing hybridization was 
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used. 

III cDNA Library Prehybridization 

The RALINOHO I cDNA library was constructed with clones from the RALINOT01 cDNA 
5 library. After preparation of the RALINOT01 cDNA library, 9,984 clones were spotted onto a nylon 
filter, lysed, and the plasmid DNA was bound to the filter. The filter was incubated with pre-warmed 
hybridization buffer and then hybridized at 42°C for 14-16 hours in 0.75 M NaCl, 0. 1 M 
Na 2 HP0 4 /NaH 2 P0 4 , 0.15 M tris-HCl (pH 7.5), 5x Denhardt's Solution, 2% SDS, 100 ug/ml sheared 
salmon sperm DNA, 50% formamide, and [ 32 P]-labeled oligonucleotide molecules made from reverse 

10 transcribed rat liver mRNA from an untreated animal. The filter was rinsed with 2 x SSC (saline sodium 
citrate) at ambient temperature for 5 minutes followed by washing for 30 minutes at 68°C with pre- 
warmed washing solution (2 x SSC, 1% SDS). The wash was repeated with fresh washing solution for an 
additional 30 minutes at 68°C. Filters were then washed twice with pre-warmed washing solution (0.6 x 
SSC, 1% SDS) for 30 minutes at 68°C. Some 4,224 clones had very low hybridization signals and about 

15 20% of the clones had no signals and two groups were isolated and sequenced. 

IV Isolation and Sequencing of cDNA Clones 

DNA was isolated using the following protocol. Single bacterial colonies were transferred into 
individual wells of 384-well plates (Genetix Ltd, Christchurch, United Kingdom) using sterile toothpicks. 

20 The wells contained 1 ml of sterile Terrific Broth (Life Technologies) with 25 mg/1 carbenicillin and 

0.4% glycerol (v/v). The plates were covered and placed in an incubator (Thermodyne, Newtown Square 
PA) at 37°C for 8-10 hours. Plasmid DNA was released from the cells and amplified using direct link 
PCR (Rao, V.B. (1994) Anal. Biochem. 216:1-14) as follows. The direct link PCR solution included 30 
ml of NUCLEIX PLUS PCR nucleotide mix (Amersham Pharmacia Biotech, Piscataway NJ) and 300 ul 

25 of Taq DNA polymerase (Amersham Pharmacia Biotech). Five microlitres of the PCR solution were 
added to each of the 384 wells using the MICROLAB 2200 system (Hamilton, Reno NV); plates were 
centrifuged at 1000 rpm for 20 seconds and refrigerated until use. A 384 pin tool (V&P Scientific Inc, 
San Diego CA) was used to transfer bacterial cells from the incubation plate into the plate containing the 
PCR solution where 0. 1 % Tween 20 caused the cells to undergo lysis and release the plasmid DNA. 

30 After lysis, the plates were centrifuged up to 500 rpm, covered with a cycle sealer, and cycled using a 

384-well DNA ENGINE thermal cycler (MJ Research, Watertown MA) using the program dPCR30 with 
the following parameters: Step 1) 95 °C, 1 minute; Step 2) 94 °C, 30 seconds; Step 3) 55 °C, 30 seconds; 
Step 4) 72 °C, 2 minutes; Step 5) steps 2, 3, and 4 repeated 29 times; Step 6) 72 °C, 10 minutes; and Step 
7) storage at4°C. 
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The concentration of DNA in each well was determined by dispensing 100 |il PICO GREEN 
quantitation reagent (0.25% (v/v), Molecular Probes, Eugene OR) dissolved in lx TE and 0.5 ul of 
undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, Acton MA), 
allowing the DNA to bind to the quantitation reagent. The plate was scanned in a Fluoroscan II 
5 (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantitate the 
concentration of DNA. Typical concentrations of each DNA sample were in the range of 100 to 500 
ng/ml. 

The cDNAs were prepared for sequencing using either a HYDRA microdispenser (Robbins 
Scientific, Sunnyvale CA) or MICROLAB 2200 system (Hamilton) in combination with the DNA 

10 ENGINE thermal cyclers (MJ Research). The cDNAs were sequenced using the method of Sanger, F. 
and A.R. Coulson (J. Mol. Biol. (1975) 94:441-448) and the ABI 377 sequencing systems (PE 
Biosystems). Most of the isolates were sequenced according to standard ABI protocols using ABI kits 
(PE Biosystems). The solution volumes were used at 0.25x - l.Ox concentrations. Typically, 500 to 700 
base pairs were sequenced in 3.5 to 4 hours. In the alternative, cDNAs may have been sequenced using 

15 solutions and dyes from Amersham Pharmacia Biotech. 

V Rat Liver and Kidney Gene Selection 

As a first step, originator molecules from high throughput sequencing experiments were derived 
from clone inserts from RALINOT01, RAKINOT01, RAKINOT02, RALINOH01, RALINON03, 
20 RALINON04 and RALINON07. cDNA library clones were obtained. There were 18,140 rat liver 
molecules and 5,779 rat kidney molecules. 

Additionally, 1,500 rat molecules derived from clone inserts of any of 1 13 rat cDNA libraries 
were selected based on their homology to genes coding for polypeptides implicated in toxicological 
responses including peroxisome-associated genes, lysosome-associated genes, apoptosis-associated 
25 genes, cytochrome P450 genes, detoxification genes such as sulfotransferases, glutathione S-transferases, 
and cysteine proteases, and the like. 

Then, all the remaining molecules derived from all of the rat cDNA library clones were clustered 
based on the originator molecules described above. The clustering process involved identifying 
overlapping molecules that have a match quality indicated by a product score of 50 using BLAST. 
30 658 1 master clusters were identified. 

After forming the clone clusters, a consensus sequence was generated based on the assembly of 
the clone molecules using PHRAP (Phil Green, University of Washington). The assembled molecules 
were then annotated by first screening the assembled molecules against GenBank using BLASTn and 
then by screening the assembled molecules against GenPept using FASTX. About two thirds of the 
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assembled molecules were annotated, about one third of the assembled molecules were not annotated. 
For example, for nucleic acid sequence analysis, the program BLASTN 1 .4.9MP-WashU was used with 
default parameters; ctxfactor=2.00; E=10; MatID, 0; Matrix name, +5,-4. In another example, for amino 
acid sequence analysis, the program NCBI-BLASTX 2.0.4 was used with default parameters; matrix, 
5 BLOSUM62; gap penalties, existence 1 1, extension 1 ; frameshift window, decay constant 50, 0.1. 

VI Substrate and Array Element/Probe Preparation 

Clones nominated in the process described in Example V were used to generate array elements. 
Each array element was amplified from bacterial cells. PCR amplification used primers complementary 

10 to the vector sequences flanking the cDNA insert. Array elements were amplified in thirty cycles of PCR 
from an initial quantity of 1-2 ng to a final quantity greater than 5 ug. Amplified array elements were 
then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

Purified array elements were immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning, Corning NY) cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled 

15 water washes between and after treatments. Glass slides were etched in 4% hydrofluoric acid (VWR, 
West Chester PA), washed extensively in distilled water, and coated with 0.05% aminopropyl silane 
(Sigma-Aldrich, St. Louis MO) in 95% ethanol. Coated slides were cured in a 1 10°C oven. 

Array elements were applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522 and incorporated herein by reference. In brief, 1 ul of the array element DNA, at an 

20 average concentration of 0.5 Mg/ml in 3 x SSC, was loaded into the open capillary printing element by a 
high-speed robotic apparatus. The apparatus then deposited about 5 nl of the array element sample per 
slide. A total of 7404 array elements representing rat liver and kidney genes and a variety of control 
elements, including 14 synthetic control molecules, human genomic DNA, and yeast genomic DNA, were 
arrayed in four identical quadrants within a 1 .8 cm 2 area of the glass substrate. 

25 Microarrays were UV-crossl inked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays were washed at room temperature once in 0.2% SDS and three times in distilled water. Non- 
specific binding sites were blocked by incubation of microarrays in 0.2% casein in phosphate buffered 
saline (PBS; Tropix Inc., Bedford MA) for 30 minutes at 60°C followed by washes in 0.2% SDS and 
distilled water as before. 

30 

VII Target Preparation 

Male Sprague-Dawley rats (6-8 wk old) were dosed intraperitoneally with one of the following: 
clofibrate (CLO; Acros, Geel, Belgium) at 250 mg/kg body weight (bw); acetaminophen (APAP; Acros) 
at 1000 mg/kg bw; benzo(a)pyrene (B(a)P; Acros) at 10 mg/kg bw; or dimethylsulfoxide vehicle (DMSO; 
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Acros) at less than 2 ml/kg bw, and the animals were later euthanized by C0 2 inhalation. Animals were 
monitored daily for physical condition and body weight. Three animals per group were sacrificed 
approximately 12 hours, 24 hours, 3d (d), 7d, 14d f and 28d following the single dose. Prior to sacrifice a 
blood sample from each animal was drawn and assayed for serum alanine transferase (ALT) and serum 
5 aspartate aminotransferase (AST) levels using a diagnostic kit (Sigma-Aldrich). Observed gross 

pathology and liver weights were recorded at time of necropsy. Liver, kidney, brain, spleen and pancreas 
from each rat were harvested, flash frozen in liquid nitrogen, and stored at -80°C. 

In the alternative, male Han-Wistar rats (8-9 wk old) were dosed by oral gavage with one of the 
following: fenofibrate (FEN; Sigma-Aldrich) at 250 mg/kg bw; carbon tetrachloride (CCL 4 ; Sigma- 

10 Aldrich) at 3160 mg/kg bw, hydrazine (HYDR; Sigma-Aldrich) at 120 mg/kg bw; a- 

naphthylisothiocyanate (ANIT; Sigma-Aldrich) at 200 mg/kg bw; 4-acetylaminofluorene (4-AFF; 
Lancaster Synthesis, Morecambe, Lancashire, UK) at 1000 mg/kg bw; corn oil vehicle, or sterile water 
vehicle, at 10 ml/kg bw. The animals were checked twice daily for clinical signs of distress. Blood was 
collected six days prior to the dose and at sacrifice. Three animals per group were sacrificed 

15 approximately six hours and 24 hours following the single dose. The animals were euthanized by 

exsanguination under isoflurane anaesthesia. Observed gross pathology and liver weights were recorded 
at time of necropsy. Livers from each rat were harvested, dissected into approximate 100 mg pieces, 
flash frozen in liquid nitrogen, and stored at -70°C. 

For each target preparation, frozen liver was homogenized and lysed in TR1ZOL reagent (Life 

20 Technologies, Gaithersburg MD) following the modifications for liver RNA isolation. Messenger RNA 
was isolated using an OLIGOTEX kit (QIAGEN) and labeled with either Cy3- or Cy5-labeled primers 
(Operon Technologies, Alameda CA) using the GEMBRIGHT labeling kit (Incyte Pharmaceuticals). 
Messenger RNA isolated from tissues of rats treated with clofibrate, acetaminophen, or benzo(a)pyrene 
was labeled with Cy5 and mRNA isolated from tissues of rats treated with DMSO was labeled with Cy3. 

25 Quantitative and differential expression pattern control cDNAs were added to each labeling reaction. 

Labeled cDNA was treated with 0.5 M sodium bicarbonate (pH 9.2) for 20 min at 85 °C to degrade the 
RNA and purified using two successive CHROMA SPIN 30 gel filtration spin columns (Clontech, Palo 
Alto CA). Cy3-labeled control sample and Cy5-labeled experimental sample were combined and 
precipitated in glycogen, sodium acetate, and ethanol. 

30 Targets are also prepared from tissue needle biopsy samples. Samples are used to identify 

changes within the tissue following exposure to, for example, a toxic compound, a potential toxic 
compound, a compound with unknown metabolic responses, and a pharmacological compound. 
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VIII Hybridization 

Hybridizations were carried out using the methods described by Shalon (supra) . 

IX Detection 

5 The microscope used to detect the reporter-labeled hybridization complexes was equipped with 

an Innova 70 mixed gas 10 W laser (Coherent Lasers, Santa Clara CA) capable of generating spectral 
lines at 488 nm for excitation of Cy3, and 632 nm for excitation of Cy5. The excitation laser light was 
focused on the array using a 20x microscope objective (Nikon, Melville NY). The slide containing the 
array was placed on a computer-controlled X-Y stage on the microscope and raster-scanned past the 
10 objective. The 1.8 cm x 1.8 cm array used in the present example was scanned with a resolution of 20 
micrometers. 

In two separate scans, a mixed gas multiline laser excited the two fluorophores sequentially. 
Emitted light was split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics, San Jose CA) corresponding to the two fluorophores. Appropriate filters 

15 positioned between the array and the photomultiplier tubes were used to filter the signals. The emission 
maxima of the fluorophores used were 565 nm for Cy3 and 650 nm for Cy5. Each array was typically 
scanned twice, one scan per fluorophore using the appropriate filters at the laser source, although the 
apparatus was capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans was typically calibrated using the signal intensity generated by a 

20 cDNA control species added to the probe mix at a known concentration. A specific location on the array 
contained a complementary DNA sequence, allowing the intensity of the signal at that location to be 
correlated with a weight ratio of hybridizing species of 1 : 100,000. When two probes from different 
sources (e.g., representing test and control cells), each labeled with a different fluorophore, are 
hybridized to a single array for the purpose of identifying genes that are differentially expressed, the 

25 calibration was done by labeling samples of the calibrating cDNA with the two fluorophores and adding 
identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube was digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Norwood MA) installed in an IBM-compatible PC computer. 
The digitized data were displayed as an image where the signal intensity was mapped using a linear 20- 

30 color transformation to a pseudocolor scale ranging from blue (low signal) to red (high signal). The data 
was also analyzed quantitatively. Where two different fluorophores were excited and measured 
simultaneously, the data were first corrected for optical crosstalk (due to overlapping emission spectra) 
between the fluorophores using each fluorophore 's emission spectrum. 

A grid was superimposed over the fluorescence signal image such that the signal from each spot 
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was centered in each element of the grid. The fluorescence signal within each element was then 
integrated to obtain a numerical value corresponding to the average intensity of the signal. The software 
used for signal analysis was the GEMTOOLS gene expression analysis program (Incyte 
Pharmaceuticals). . 

5 

X Results 

The expression patterns of eight cytochrome P450 isozymes known to be induced in a 
toxicological response were monitored during the 28 day time course. The results using clofibrate, 
acetaminophen, and benzo(a)pyrene are shown in Table 1, Table 2, and Table 3, respectively. Each of 
10 the known genes was upregulated or downregulated greater than 2-fold at least once during the time 
course. 



TABLE 1 Gene expression patterns (x-fold change) of known genes in clofibrate-treated rat liver 



Gene 


12 hours 


24 hours 


3 days 


7 days 


28 days 


P450 LA-omega 4A3 


14.8 


26.6 


1.1 


0.5 


0.47 


P450 4A 


7.0 


16.6 


1.4 


0.5 


1.3 


P450 3A2 


0.14 


1.2 


0.63 


0.50 


0.45 



TABLE 2 Gene expression patterns (x-fold change) of known genes in acetaminophen-treated rat liver 



Gene 


1 2 hours 


24 hours 


3 days 


7 days 


14 days 


28 days 


P450 4A 


1.0 


4.5 


2.1 


2.0 


4.4 


4.8 


P450f 2C7 


0.21 


0.43 


0.47 


0.5 


1.2 


1.3 


P450 14DM 


0.31 


0.20 


2.0 


1.1 


1.4 


0.42 


TABLE 3 Gene expression patterns (x-fold change) of known genes in benzo(a)pyrene-treated rat liver 


Gene 


12 hours 


24 hours 


3 days 


7 days 


14 days 


28 days 


P450 LA-omega 4A3 


1.2 


2.3 


2.4 


1.4 


6.8 


1.2 


P450 MCA-inducible 1A2 


7.3 


9.2 


5.7 


2.5 


2.5 


0.5 



In addition, results from two samples that had been treated identically were compared to 
30 determine the range of normal variation of gene expression patterns between the samples. In one 

analysis, where two different samples were prepared from identically treated tissues, gene expression 
patterns of cDNAs which were upregulated or downregulated not more than 1 .7-fold were within the 95% 
confidence limits of a Poisson normal distribution. In a separate analysis, gene expression patterns of 
cDNAs which were upregulated or downregulated more than 2-fold accounted for not more than 5% of 
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the total hybridizable sample nucleic acid molecules in two identically-treated tissue samples. 

We have discovered novel nucleotide molecules that are up-regulated or down-regulated at least 
2-fold at least once during the time course. These molecules are SEQ ID NOs: 1-16 provided in the 
Sequence Listing. These polynucleotide molecules can be used for screening compounds or therapeutics 
5 for a toxicologic effect and applications including detecting metabolic and toxicological responses, and in 
monitoring drug mechanism of action. 

Table 4 shows the gene expression pattern of selected molecules that were upregulated at least 2- 
fold at least once during the time course following treatment with clofibrate (CLO). Table 5 shows the 
gene expression pattern of selected molecules that were downregulated at least 2-fold at least once during 
10 the time course following treatment with CLO. 



TABLE 4 Gene expression patterns (x-fold change) of CLO-upregulated nucleic acid molecules 



SEQ ID NO: 


12 hours 


24 hours 


3 days 


7 days 


28 days 


2 


2.6 


1.4 


0.5 


1.1 


1.2 


3 


1.3 


2 


1.3 


1.5 


1.5 


4 


2 


0.36 


0.47 


0.26 


0.30 


5 


1.7 


2.9 


1.6 


1.5 


1.2 


8 


2.6 


1.7 


1.3 


1.3 


1.4 



20 



TABLE 5 Gene expression patterns (x-fold change) of CLO-downregulated nucleic acid molecules 



SEQ ID NO: 


1 2 hours 


24 hours 


3 days 


7 days 


28 days 


1 


n.d. 


0.26 


0.45 


0.26 


1.1 


4 


2.0 


0.36 


0.47 


0.26 


0.30 


7 


0.24 


0.42 


0.37 


1.1 


1.5 



(n.d. = not detected) 



Table 6 shows the gene expression pattern of selected molecules that were upregulated at least 2 
fold at least once during the time course following treatment with acetaminophen (APAP). Table 7 
3 0 shows the gene expression pattern of selected molecules that were downregulated at least 2-fold at least 
once during the time course following treatment with APAP. 
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TABLE 6 Gene expression patterns (x-fold change) of APAP-upregulated nucleic acid molecules 



CPA 1V\ MM- 


i z nours 


z4 nours 


3 days 


7 days 


1 A Hauc 

i 4 * udys 


Hive 


9 


i i 

i .j 




t i 
i . i 


U.J 


1 7 




3 


1.2 


2.1 


0.47 


0.46 


1.8 


1.5 


4 


3.3 


0.47 


0.47 


0.23 


0.35 


0.36 


5 


1.1 


2.1 


1.1 


1.2 


1.3 


1.4 


6 


1.8 


5 


2.5 


1.1 


1.4 


1.3 


8 


1.1 


2.5 


1.1 


1 


1.7 


1.4 



10 TABLE 7 Gene expression patterns (x-fold change) of APAP-downregulated nucleic acid molecules 



SEQ ID NO: 


1 2 hours 


24 hours 


3 days 


7 days 


14 days 


28 days 


1 


0.36 


0.19 


0.46 


0.25 


0.5 


1.4 


4 


3.3 


0.48 


0.47 


0.23 


0.35 


0.36 


7 


0.33 


0.21 


1.7 


n.d. 


1 


0.39 



15 (n.d. = not detected) 

Table 8 shows the gene expression pattern of selected molecules that were upregulated at least 2- 
fold at least once during the time course following treatment with benzo(a)pyrene (B(a)P). Table 9 
shows the gene expression pattern of selected molecules that were downregulated at least 2-fold at least 
2 0 once during the time course following treatment with B(a)P. 



TABLE 8 Gene expression patterns (x-fold change) of B(a)P-upregulated nucleic acid molecules 



SEQ ID NO: 


1 2 hours 


1 day 


3 days 


7 days 


14 days 


28 days 


2 


0.5 


0.47 


1.2 


1.1 


2.6 


0.47 


3 


1.4 


2.1 


1.2 


1.5 


2.7 


1.6 


5 


1.5 


1.4 


1.2 


0.47 


2 


0.46 


6 


2.2 


1.4 


1.4 


1.2 


2.2 


n.d. 


7 


1.2 


2.2 


1.4 


0.5 


0.42 


1.1 


8 


1.6 


1.7 


1.3 


1.3 


2 


1.1 



(n.d. = not detected) 
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TABLE 9 Gene expression patterns (x-fold change) of B(a)P-downregulated nucleic acid molecules 



SEQ ID NO: 


12 hours 


1 day 


3 days 


7 days 


14 days 


28 


1 


0.37 


0.39 


0.35 


1.4 


0.33 


1.5 


4 


0.5 


0.26 


0.31 


0.36 


0.47 


n.d. 



5 (n.d. = not detected) 

Table 10 shows the library abundance of selected molecules that were up- or down-regulated at 
least once following treatment with various agents. Library abundance of each SEQ ID NO is presented 
as relative to that library which included the least abundant levels of nucleic acid molecule (SEQ ID NO) 
1 0 present. 



TABLE 10 Library abundance (least abundant = 1) patterns of nucleic acid molecules 



SEQ ID 
NO: 


Untreated 


CLO 


FEN 


APAP 


BaP 


CCI 4 


HYDR 


ANIT 


4-AAF 


8 


4 


7 


6 


3 


9 


4 


1 


1 


3 


9 


13 


5 


6 


4 


15 


5 


6 


6 


2 


10 


n.d. 


1 


8 


3 


n.d. 


n.d. 


n.d. 


1 


n.d. 


11 


5 


2 


4 


8 


20 


7 


10 


n.d. 


2 



(n.d. = not detected) 



20 

XI Identification and Analyses of Homologous Molecule in other Organisms 

The rat sequences (SEQ ID NOs:l-I6) were used to identify additional sequences in the 
ZOOSEQ and LIFESEQ databases (Incyte Pharmaceuticals) related to rat nucleic acid molecules 
regulated during toxicological response (SEQ ID NOs: 18-47). 

2 5 The first pass cDNAs, SEQ ID NOs:5, and 60 through 1 34, were assembled using PHRAP (Phil 

Green, supra) , using the following default parameters, to produce the contiguous sequence SEQ ID 
NO: 135. Mismatch penalty = -2; gap initiation penalty <0; gap extension penalty <0; minimum length of 
matching word = 14; minimum SWAT score = 30; bandwidth = 14; use raw SW scores, "No"; index 
word size = 10; maximum gap size = 30; number of initial bases to be converted to *N\ 0; vector segment 

3 0 length = 60; Mismatch penalty for scoring degenerate end sequence = -2; Min. score for converting 

degenerate end sequence to 'N\ 20; Minimum size of confirming segment = 8; Amount by which 
confirming segments are trimmed = 1 ; Penalty for confirming matches = -5; Min. SWAT score for 
confirming matches = 30; LLR cutoff for displaying discrepancies = 20; Minimum segment size = 8; 
Spacing between nodes = 4; Split/reassemble initial 'greedy' assembly, "No". 
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Translation of SEQ ID NO: 135 using MACDNASIS PRO software (version 1 .0, Hitachi 
Software Engineering) using default parameters of the program elucidated the putative protein coding 
region, SEQ ID NO: 136. The nucleic acid and amino acid sequences were queried against databases such 
as the LIFESEQ (Incyte), GenBank, and SwissProt databases using BLAST. Motifs, HMM algorithms, 
5 and alignments with BLOCKS, PRINTS, Prosite, and PFAM databases were used to perform functional 
analyses; the antigenic index (Jameson-Wolf analysis) was determined using LASERGENE software 
(version 1 .62dl, DNASTAR). BLAST2 analysis of SEQ ID NOs:135 and 136 using the human EST 
LIFESEQ database (Incyte) identified Incyte Clone Numbers 746355H1 (SEQ ID NO: 137) and 
1294663H1 (SEQ ID NO: 138) which were assembled with their respective clustered clones to produce 

10 SEQ ID NOs:37 and 38 which encoded SEQ ID NOs:5 1 and 52, respectively. 

Functional analysis of SEQ ID NO: 136 using BLOCKS, PRINTS, Prosite, PFAM, Motifs, and 
HMM algorithms identified a potential protein kinase C phosphorylation site at residue S84 (Motifs); a 
potential signal peptide from residue Ml through residue A33 (SPScan); a potential transmembrane 
domain from residue P37 through residue L56 (HMM TM), a sodium/neurotransmitter symporter 

15 signature from residue G34 through A53, a sodium/alanine symporter signature from G34 through A53, 
and an asparaginase/glutaminase family signature from residue W64 through residue G75 (BOCKS and 
PRINTS). 

Functional analysis of SEQ ID NO:51 using BLOCKS, PRINTS, Prosite, PFAM, Motifs, and 
HMM algorithms identified a potential protein kinase C phosphorylation site at residue S83 (Motifs); a 

20 potential signal peptide sequence from residue Ml through residue A52 (SPScan); a sodium/alanine 

symporter signature from residue G33 through residue A52, an asparaginase/glutaminase family signature 
from residue W63 through residue G74, and a channel-forming colicin domain from residue K31 through 
residue G49 (BLOCKS and PRINTS). Functional analysis of SEQ ID NO:52 using BLOCKS, PRINTS, 
Prosite, PFAM, Motifs, and HMM algorithms identified a potential signal peptide sequence from residue 

25 Ml through A53 (SPScan); a sodium/alanine symporter signature from residue G34 through residue A53, 
a 6-phosphogluconate dehydrogenase family signature from residue G 15 through residue A40, an FAD- 
dependant glycerol-3-phosphate dehydrogenase family signature from residue Y18 through residue Y30, 
and a vacuolar ATP synthetase 16 kDa subunit signature from residue L39 through residue G65 
(BLOCKS and PRINTS). 
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CLAIMS 

What is claimed is: 

1 . A method for detecting or diagnosing the effect of a toxic compound or molecule associated with 
increased or decreased levels of nucleic acid molecules in a mammalian subject comprising: 

a) treating a mammalian subject with a toxic compound or molecule; 

b) obtaining a sample containing nucleic acids from the mammalian subject treated with the 
toxic compound or molecule; 

c) contacting the sample with a microarray comprising a plurality of nucleic acid molecules 
of SEQ ID NOs:l-47, or a fragment thereof under conditions for the formation of one or 
more hybridization complexes; and 

d) detecting the hybridization complexes, wherein the presence, absence or change in amount of 
the hybridization complex, as compared with the hybridization complexes formed from nucleic 
acid molecules from an untreated mammalian subject, is indicative of a metabolic response to the 
toxic compound or molecule. 

2. The method of claim 1 wherein: 

a) the sample is a tissue chosen from liver, kidney, brain, spleen, pancreas, and lung; 

b) the sample is liver tissue; 

c) the toxic compound or molecule which elicits the metabolic response induces at least a 
2-fold change in the amount of at least one of the nucleic acid molecules of the sample; 

d) the toxic compound is a peroxisome proliferator; 

e) the toxic compound is a hypolipidemic drug; and 

f) the toxic compound is clofibrate or one of its corresponding metabolites. 

3. The method of claim 1 wherein: 

a) the sample is a tissue chosen from liver, kidney, brain, spleen, pancreas, and lung; 

b) the sample is liver tissue; 

c) the toxic compound or molecule which elicits the metabolic response induces at least a 
2-fold change in the amount of at least one of the nucleic acid molecules of the sample; 

d) the toxic compound is acetominophen or one of its corresponding metabolites. 

4. The method of claim J wherein: 

a) the sample is a tissue chosen from liver, kidney, brain, spleen, pancreas, and lung; 

b) the sample is liver tissue; 

c) the toxic compound or molecule which elicits the metabolic response induces at least a 
2-fold change in the amount of at least one of the nucleic acid molecules of the sample; 

d) the toxic compound is a polycyclic aromatic hydrocarbon; 
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e) the toxic compound is a diol epoxide; and 

f) the toxic compound is benzo(a)pyrene, or one of its corresponding metabolites. 

5. A method for detecting or diagnosing a toxicological response to a test compound or molecule in 
a mammalian subject, the method comprising: 

a) treating a mammalian subject with a test compound or molecule; 

b) obtaining a sample containing nucleic acids from the mammalian subject treated with the test 
compound or molecule; 

c) contacting the sample with a microarray comprising a plurality of nucleic acid molecules 
of SEQ ID NOs: 1-47, or a fragment thereof, under conditions for the formation of one or more 
hybridization complexes; 

d) detecting the hybridization complexes, wherein the presence, absence or change in amount of 
the hybridization complex, as compared with the hybridization complexes formed from nucleic 
acid molecules from a normal or untreated mammalian subject, is indicative of a toxic response 
to the test compound or molecule. 

6. The method of claim 5 wherein the test compound which elicits the metabolic response is a 
compound with a previously known metabolic response. 

7. The method of claim 5 wherein the test compound which elicits the metabolic response is a 
compound with a previously unknown metabolic response. 

8. An isolated and purified nucleic acid molecule selected from SEQ ID NOs: 1-11, 17-33, 36, 39, 
and 41, or a fragment thereof, wherein said fragments are at least 60 contiguous nucleotides in length. 

9. A method of using a molecule selected from SEQ ID NOs: 1-59 or a fragment thereof to screen a 
library of molecules or compounds to identify at least one molecule or compound which specifically 
binds the selected molecule, the method comprising: 

a) combining the selected molecule with a library of molecules or compounds under conditions 
to allow specific binding; and 

b) detecting specific binding, thereby identifying a molecule or compound which specifically 
binds the selected molecule. 

10. The method of claim 9 wherein the library is selected from DNA molecules, RNA molecules, 
peptide nucleic acids, artificial chromosome constructions, peptides, proteins, and dnigs. 

11. An isolated and purified antibody identified using the method of claim 9. 

12. An isolated and purified nucleic acid molecule variant having at least 70% nucleic acid sequence 
identity to the nucleic acid molecule of claim 8. 

13. An isolated and purified nucleic acid molecule having a sequence which is complementary to the 
nucleic acid molecule of claim 8. 
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14. An isolated and purified agonist identified using the method of claim 9. 

15. An isolated and purified antagonist identified using the method of claim 9. 

16. An expression vector comprising at least a fragment of the nucleic acid molecule of claim 8. 

17. A host cell comprising the expression vector of claim 16. 

18. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 17 under conditions suitable for the expression of the 
polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

19. An isolated and purified protein molecule encoded by the nucleic acid molecule selected from 
SEQ ID NOs:l-l 1, 17-33, 36, 39, and 41, an isolated and purified protein molecule of SEQ ID NOs:50 
and 53, or a portion thereof, wherein said portions encode at least 20 contiguous amino acids in length. 

20. A pharmaceutical composition comprising the protein molecule of claim 19 in conjunction with a 
suitable pharmaceutical carrier. 



36 



WO 01/36684 



PCT/USOO/31743 



SEQUENCE LISTING 



<110> INCYTE GENOMICS, INC. 
CUNNINGHAM , Mary Jane 
ZWEIGER, Gary 
KASER, Matthew R . 
PANZER , Scott 
SEILHAMMER, Jeffrey J. 
YUE, Henry 
BAUGHN , Mariah 
AZIMZAI, Yalda 
LAL, Preeti 

<120> MAMMALIAN TOXICOLOGICAL RESPONSE MARKERS 

<130> PC-0007 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 09/443,184 
<151> 1999-11-19 

<160> 138 

<170> PERL Program 

<210> 1 
<211> 285 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 
<223> Incyte ID No.: 700305024H2 

<400> 1 

agagttctag cctcacttta agatgcttct 
aagcgtantt ccaaagcatg ttacttacat 
aactctatca catggtttct actctctaat 
tcccaatcag aattttaaat cttgactttt 
gaccaaggca ggaaaaagaa aacaaggttc 

<210> 2 
<211> 291 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 
<223> Incyte ID No.: 700306220H1 



<400> 2 

ggctgcggtg ccttcggtcg cgcgtacacg ttgcatctcc tagcttcctc ctgaaccccg 60 

ttttacgttc gcggcgggga aaacagcctg acgagtagac tgcagctcct gggagatggc 12 0 

ggcgctgtgc cttacggtga acgccggaaa ccctccactg gaagctctgc tggcagtgga 180 

gcatgtgaaa ggtgatgtca gcatttctgt ggaagaaggg aaggagaatc ttcttcgggt 240 

ttctgagagt gtggtgttca ctgacacaaa ttcaatcctg cgctacttgg c 291 

<210> 3 
<211> 293 
<212> DNA 
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ttctctcaga attaaaggac tcgttttact 60 

tccttcttgc tatccacaga cctggtaatt 120 

ggagaacagg agaaaaatga gtcccaagct 180 

tttcccaaat catttaactg gagatgaaca 240 

tagagatcat ttgac 285 
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<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700510669H1 

<400> 3 

aggacctgtc cttacatatt gtggcctgaa 
acaattccac tgtttatttt ccttggtgct 
tttgaaattt tggcctactt tatttcagac 
tgatatattt gagagtggga aggaatctct 
aatggactag gcttaggatt acaatgaacc 



gggacaaaat atgaggagtt naatannagg 60 
aaattaaaga atcaagccct tgttcgagcc 120 
actcaaaata caaatgccaa caaatggtnc 180 
gatgtttaaa tttcactgtt gatctttcaa 240 
ttttgtcctt tgtcagtgtt teg 293 



<210> 4 
<211> 260 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700525676H1 



<400> 4 



geagctegga ctagtcagag gnctctggcg 
cccacagtga cggangcccg ggtgcgaggg 
tgtggtatac tgatgaacag aacccttgca 
tgtgcaagaa ccttcttcag aatcactaca 
gagaggagaa ttatagggta 



agggtggcat egggatgecg tccgaagtca 60 
tctgcgcgca aegtcaggta cttagctccc 120 
ccgtgggcct cagttttgee taaagagatc 180 
ccattagtaa ataagegana ggagtattca 240 

260 



<210> 5 
<211> 290 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700535332H1 



<400> 5 



aagggccagt tgcatccgca cccagtgctt gtaccttgaa ctcatttctt cctgactgct 60 

agaggectgt gtgttcttaa ctgctccgac ctctcctcca caggtgeagg cctggtgtgg 120 

tctccaaagt gactgaacaa tgcagaagga cagtggccca ctggttcctt tacattatna 180 

tggtttcggc tatgcggccc tggtggctac tggtgggatt attggctatg caaaagcagg 240 

tatgtgccgt ccctggctgc tggatcttct ttgggggcct ggcaggctgg 290 



<210> 6 
<211> 287 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 700536004H1 



<400> 6 



attatgtaaa taatgagcaa gatcaaatta 
tagtcagttt tgtcatgggg cagttcaagc 
ggttcttggc accactcacc caagttacat 
tcaccatgca tagtgggatt ttcagccatg 
ctctaaaata acctttgeta ateaatgeag 



acaaagacta gttacccagc attccgcatc 60 
tgccacctga gaacatcact aggctctcag 120 
ccaccagatt attttcagtc ttcacaagta 180 
aataaagggc gtgcgttttg ccatatcagt 240 
tgagttgcta aggttta 287 
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<210> 7 
<211> 264 
<212> DNA 

<213> Rat t us norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700640924H1 

<400> 7 

gtgatgaaat gaggtatctc aaatccactg acagataaga aaacagggtt agagggaaag 60 
tcacctctgt cacgtagagg cagaatatat gaacttaact ctagtttcca tgtctgtctt 120 
tattaccttc atctttctac ttcctggcca caggcatttc acttaattga gcctaatgtc 180 
agtatctgtg tgtgtcaatg tcgttaccac attctgatga agctaaaaaa taaaatttnn 240 
tttgggccaa aaaaaaaaaa aagg 264 



<210> 8 
<211> 238 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700775760H1 

<400> 8 

ganaccgaca ttttaatggt tcttangagg accaccacta gagtcaaggn ganaatggga 60 

tgacgcgtgt tgcngtcctg ctgattctga caagagctgn tcactatgac agacagatgg 120 

actgaatgga ctagaattat gtgaatctgt attatttaca gttggtancc aagagcatcg 180 

atactcttta gagaggcagg ttaaataaag gattaagtat ttaggatntg aaatttat 238 



<210> 9 
<211> 112 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700132084H1 

<400> 9 

ctatgcccaa ggaaaaggct ccagaacaca ttccccttct cttcattgcc ttcccatcaa 60 
gcaaggatcc aacctgggag gaccgattcc cagnnncggg ncannaagnn gg 112 



<210> 10 
<211> 238 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700176719H1 

<400> 10 

tcttggtccc ttcacctgac ctccggtgct ccaacgggcg gcagaatgga agaaggtgag 60 
gacccaggaa gtctgattaa agtgatccac ttgctggtct tgtctggtgt ctggggcatg 120 
cagatgtggg tgacctttgc ctcaggcttc ctgcttttcc ggagcctccc gaggcacacg 180 
tttggacttg tgcagagcaa gctcttccca gtctattttc acgtctcctt gggttgtg 238 
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<210>' 11 
<211> 247 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701195696H1 

<400> 11 

ggatctttct gggcgagcaa cccgcaaaac 
gttcccgatt ctactcccca cccgcgaccg 
gacgttgagg aggatgactg tctttctgaa 
caggacaaag tggcttttat cactggtggt 
attttca 



gttgtgcatt gcgttgaaaa ggtgcatctg 60 
cacacagcaa acatgaccca gcagccgcct 120 
taccaccacc tcttctgccc ggaccttctc 180 
ggttctggga ttggcttccg gatcgccgag 240 

247 



<210> 12 
<211> 256 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 
<223> 700483259H1 

<400> 12 

gtgacgtaca tggaaaacaa agcctacggg 
aaagcgcctn cggccctgaa gcatggcagc 
ccatgactac tatcggcgta agtagcccct 
tctgtggctg acccgcctcc ccttcccagg 
cggcggaagt gcagag 



gacaggctca agccgcagac agcagcaagt 60 
tatcccttcc agcggctcgc tcgtggctac 120 
cgccagcccc gcccagggct ggcccagggc 180 
acgtctgggc tcctcgtcca gcaacagctc 240 

256 



<210> 13 
<211> 285 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700607235H1 

<400> 13 

ctgaagaccc accatgtctc tgctgactac tgtactactt ctctggggtt tcattctggg 60 
cccagcaact gacacagcct gtatattcaa ggaagcctcg gaaaacagtc ccttgcccag 120 
gccctggctt tctgccaatc cagtgccctg gatcacacct ggcctgagga cattcctgct 180 
gtgccagggg acagtgcggg atgtagtctt catgctgagg cgggaaggag atgatggttt 240 
cctggcgata gtccaacaga tgtttttctg gagggagctg gaccc 285 



<210> 14 
<211> 293 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700609074H1 

<400> 14 

ggcgtggagt tggagnagag cgtcaggcgc 
cccaagaagg caggggctct tatgaggaag 
tctatcgtgt acgggctcaa gcagaaagat 



ctccgggaga agtttcatgg aaaagtgtcc 60 
tttggcagcg accacactgg agttgggcgc 120 
ggacaggagc tgagcaacga tttggacgnc 180 
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caggacccac cagaggacat gaagcaggac caagatatcc aggcagtagc cacctctctg 240 
ttgcccctga cgcaagccaa tcttcgaatg ttccaaagag cccaagatga cct 293 



<210> 15 
<211> 268 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700627890H1 

<400> 15 

gtacaangag ngccggggct tgggtctagt 
cagatgagag agttagccga gttaggggca 
gggctaacta taaagaagac tagatttcgn 
accgcccttg tcaaacaaca cagggccagc 
tgctggcatg ggggcacttg taccgtcc 



tggaggggan gcagtggcca gtncagggct 60 
gctactagga tgggggcagg aggagaagcg 120 
cacagtgggt atgtggaagg cagctttcaa 180 
agccttcaag accaggctat ccctgccgtc 240 

268 



<210> 16 
<211> 265 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700629293H1 

<400> 16 

atgaccttta acttttctaa aaatgtgaag 
tntagcattc tttagtgtac ttagtttgat 
tgatttttat gaatgttcat tttaagactc 
ataagcccga gaagaggatt cccttgggtg 
tctgaactgc aaccaaggcc tttnc 



ttttgtactt atatatatca gctaaagtat 60 
gccactttta gtgtttttgt tgcttttgtc 120 
cttgttgaaa tgggacagtt tcgttctttg 180 
ttgacctcct ctgcatgatg tgcccaagca 240 

265 



<210> 17 

<211> 267 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 701322438H1 

<400> 17 

acctgccctt acatattgtg gcctgaagng 
attccccctt tcaggaaaga tgttctctta 
cctttgttca agcctttgca attttggcct 
atatttggga gtgggaaggn tctttgattt 
ctaggcctta ggattacaat gaacaac 



acaaaatatg agaagttcaa tgaaaagata 60 
ttttacttgg cgctaaatca aagaatcaag 120 
attttatttc agagagcaaa tggatggtat 180 
ttaaatttca ctgntgagct ttcaaataga 240 

267 



<210> 18 

<211> 239 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 701082352H2 

<400> 18 
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atttcttagt ggggcaagga cctgccctta 
gaagttcaat gaaaagataa ttcccccttt 
gctaaatcaa agaatccagc cctttgttca 
agagagcaaa tggttgttat atatttggga 



catattgtgg cctgaaggga caaaatatga 60 
caggaaagat gttctcttat tttacttggn 120 
agcctttgca attttggcct attttatttc 180 
gtgggaagga atcttgattt ttaaatttc 239 



<210> 19 
<211> 244 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701423834H1 

<400> 19 

gtctcctgag tgcttaaatt acaggtgtgt 
aaaatgtcag tcactttaga tttcttagtg 
ctgaagggac aaaatatgag aagttcaang 
ttctcttatt ntacttggtg ctaaatcaaa 
tntg 



accactaaac caaccctaag aatccatttt 60 
gggcaaggac ctgcccttac atattgtggc 120 
aaaagataat tccccctttc aggaaagatg 180 
gaatcaagcc tttgttcaag cctttgcaat 240 

244 



<210> 20 
<211> 240 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701423842H1 

<400> 20 

gtctcctgag tgcttaaatt acaggtgtgt accactaaac caaccctaag aatccatttt 60 

aaaatgtcag tcactttana tttcttagng gggcaaggac ctgcccttac atattggggc 120 

ctgaagggac aaaatatgag aagttcaatg nanagntnan tccccctttc aggaaagatg 180 

gtctcttatt ttacttggng ctaaatcaaa gaatcaagcc tttgntcaag cctttgcaat 240 



<210> 21 
<211> 224 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701090430H1 

<400> 21 

ggcagctcgg accagtcaga gggccctggc 
gaccaccgtg acggaagccc cggcgcgggg 
ctgtggtgta ctgataaaca gaacccttcc 
ctgtgcgaga actttcttca gantctctgc 



gagggtggca tcggggtgcc atccgaagtc 60 
gtctgcgcgc gacgtcagac acttagctgc 120 
accgtgtgct gcagttttgc ctaaagagat 180 
gccactagta aata 224 



<210> 22 
<211> 249 
<212> DNA 

<213> Mus musculus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700966369H1 

<400> 22 
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gcttttatgt ancccaatca gagcancgac 
agcagaagan ctgcagantt ctgcttgatt 
cacagtagat aggaaaaagg tgtttgacag 
atcagatggc gaagggactt gctgccaggt 
gtgtgctgg 



cagnaaaatt gcaagtnttg agaggcacac 60 
ggcatctatc gttcctcctg agcagcaacc 120 
gcctggctaa gctcttgcgg agccactggc 180 
tgctgtctgt tggacagaag ctcngatgag 240 

249 



<210> 23 
<211> 260 
<212> DNA 
<213> Mus 



musculus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700828522H1 



<400> 23 



caggcctggt gtggtctcca aagcgactga acaatgcaga aagacagtgg cccattgatg 60 

cctttacatt attttggttt cggctatgca gccctggttg ctaccggtgg gattattggc 120 

tatgccaaag caggtagtgt gccgtccctg gctgctggac tcttcttcgg gggcctggca 180 

ggcctggggg cctaccagct gtctcaggat cccaggaatg tgtgggtttt cctagctaca 240 

tctgggacct tggccggaat 260 



<210> 24 
<211> 246 
<212> DNA 

<213> Mus musculus 



<220> 

<221> mi sc_ feature 

<223> Incyte ID No.: 701250723H1 



<400> 24 



ctcggcttct cgctgtctgc tcgcgccctc 
tccggctcca gtgttggttg ggtgcaggcc 
cagaaagaca gtggcccatt gatgccttta 
gttgctaccg gtgggattat tggctatgcc 
ggactc 



gtcctacagc acaggcctcc cggctccggc 60 
tggtgtggtc tccaaagcga ctgaacaatg 120 
cattattttg gtttcggcta tgcagccctg 180 
aaagcaggta gtgtgccgtc cctggctgct 240 

246 



<210> 25 
<211> 252 
<212> DNA 
<213> Mus 



musculus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 701254093H1 



<400> 25 



acctcggctt ctcgctgtct gctcgcgccc 
gcttccggct ccagtgttgg ttgggatgcc 
cctggttgct accggtggga ttattggcta 
tgctggactc ttcttcgggg gcctggcagg 
aggaatgtgt gg 



tcgtcctaca gcacaggcct cccggctccg 60 
tttacattat tttggtttcg gctatgcagc 120 
tgccaaagca ggtagtgtgc cgtccctggc 180 
cctgggggct accagctgtc tcaggatccc 240 

252 



<210> 26 
<211> 237 
<212> DNA 

<213> Mus musculus 



<220> 

<221> misc_feature 
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<223> Incyte ID No.: 701423901H1 



<400> 26 



attttggttt cggctatgca gccctggttg 
caggtagtgt gccgccctgg ctgctggact 
taccagctgt ctcaggatcc caggaatgtg 
ccggaattat ggggatgaga ttctacaact 



ctaccggtgg gattattggc tatgccaaag 60 
cttcttcggg ggcctggcag gcctggggcc 120 
tgggttttcc tagctacatc tgggaccttg 180 
cggggaaatt tatnctgcag gntaatc .237 



<210> 27 

<211> 274 

<212> DNA 

<213> Mus musculus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701251161H1 



<400> 27 



ggtgtttcgt gggttatctt 
aacgaagcct gcaaggaagg 
atcccttcca gcggctcgct 
gccagccccg cccagggctg 
cgcctgggct cctcgtccag 



tgcaaatggg ctccgcggcc 
ggttctccgc cgagcgcctc 
cgtggctacc catgactact 
gcccagggcc ctgtggctga 
cagcagctcc ggcg 



tagcgccctg gtggcctaaa 60 
ggtcctgaag catggcagcc 12 0 
atcggcgtaa gtagcccctc 180 
cccgcctccc cttcccagga 240 

274 



<210> 28 

<211> 141 

<212> DNA 

<213> Mus musculus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701085115H2 



<400> 28 



aaagtgtccc ccaanaaggc aggggctctt atgaggaagt ttggcagcna ccacaccgga 60 
gttgggngct ctatcgtgta tggtgtcaag cagnaagacg gacangagct gatgcaacga 12 0 
cctggacgct caggacccac c 141 



<210> 29 

<211> 274 

<212> DNA 

<213> Mus musculus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 701387375H1 



<400> 29 



ggagggctcg ctcttggggc tagtggtggg 
gagaggtggc agaattagag gcagccacta 
gtataaagga nactagattt tgggacagtg 
ttaacaatcc acaaagaacc agnngctttc 
cttggacgtn caggangcac angtttcaca 



gaggcagtgg ccagttcagg gctcagatga 60 

ggatgggggt gcnaggagaa gcggggctaa 120 

gacgtgtgga aggcagcttc caaagcgcct 180 

aagaccaggc tatccctgct gnctgctgna 240 

ggcg 274 



<210> 30 

<211> 257 

<212> DNA 

<213> Mus musculus 
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<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701389479H1 

<400> 30 

agggctcgct cttggggcta gtggtgggga 
gangtggcag aattagaggc agccactagg 
ataaaggaga ctanattttg ggacagtgga 
aacaatccac anagaaccag cagctttcaa 
gacgtcagga ngnacaa 



ggcagtggcc agntcagggc tnagatgaga 60 
atgggggtgc gaggagaagc ggggctaagt 120 
cgtgtggaag gcagnttnca aagcgccttt 180 
gaccangcta tccctgctgc tgctgcactt 240 

257 



<210> 31 

<211> 246 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 701389530H1 

<400> 31 

caaggagggc tcgctcttgg ggctagtggt ggngaggcag nggccagttc agggctcaga 60 

tganagaggc ggcanaatta gaggcagcca ctaggatggg ggtgccgagg agaagcgggg 120 

ctaagtataa aggagactag attttgggac agtggacgtg tggaaggcag cttccaaagc 180 

gcctttaaca atccacaaag aaccagcagc tttcaagacc angctatccc tgctgctgct 240 

gcactt 246 



<210> 32 

<211> 258 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 701388372H1 

<400> 32 

gagggctcgc tcttggnggc taagnggtgg 
gagagaggtg gcagaattag aggcagccac 
agtataaagg agactagatt ttgggacagt 
tttaacaatc cacanagaac cagnagcttt 
acttgacgtc cagaaggc 



ggagtcagtg gccacgttca gggctcanat 60 
taggatgggg gngccaggag aagcnggcta 120 
ggacgtgngg aaggcagctt ccaaagcgcc 180 
caaagaccag gctatccctc tgctgctggc 240 

258 



<210> 33 

<211> 257 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701270715H1 

<400> 33 

gttttctcat gaattgtttt tgcattgttg ataaagctag tatacccttt ggccttagcc 60 

tataaatttt aaatatataa acaaaatatt aaagatgtag ttaattttaa atgaccttta 120 

acttttctaa aaatgtgaag ttttgtactt acatatcatc taaagtatta tagcattttt 180 

aagtgtactt agtttgatgc cacttttagt gttttgttgc ttttgtctga tttttgtgaa 240 

tgttcatnta agactcc 257 



<210> 34 
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<211> 4850 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 2302721 

<400> 34 



cgcacacgtt gcatcttctt cctttcgcgg 
gtactgacag gtggaccagc ggactggtgg 
caggagaccc tccgctagga gctttgctgg 
tttccgttga agaagggaaa gagaatattc 
atgtgaattc tatacttcgc tacttggcta 
ctaatctgat ggaacatact gagattgatc 
cttcatgtga ttcctttact tctacaatta 
catacttagt tggaaactcc ttgagtttag 
gaaatgctgc ctggcaagaa cagttgaaac 
ggtttggctt tcttgaagcc cagcaggcct 
caacaaccaa agctcgagtg gcacctgaga 
ttccaggtgc ggagatggga aaggttaccg 
tacacattgg gcatgcaaaa gctgctcttc 
ggaaactgat catgagattt gatgacacaa 
aggttatctt ggaagatgtt gcaatgttgc 
cggatcattt tgaaactata atgaagtatg 
atgtggatga tactcctgct gaacagatga 
aacatagaaa aaaccctatt gagaagaatc 
gccagtttgg tcagtcctgt tgtttgcgag 
gcatgagaga tccaaccctt tatcgctgca 
aatacaatgt ttatccaaca tatgattttg 
ttacacatgc cctgagaaca acagaatacc 
ttgaagcttt aggcataaga aaaccatata 
acacagtgct atccaaaaga aaactcacat 
gggatgaccc aagatttcct acggttcgtg 
gactgaaaca gtttattgct gctcagggct 
acaaaatctg ggcgtttaac aaaaaggtta 
tactgaagaa agaagtgatc ccagtgaatg 
tagccaaaca cccaaagaat cctgaggttg 
ttttcattga aggtgctgat gcagagactt 
attggggcaa cctcaacatt acaaaaatac 
ttgatgcaaa gttgaatttg gaaaacaaag 
ttgcagagac tacacatgct cttcctattc 
tcacaaagcc agtgctagga aaagacgagg 
agcatgaaga gctaatgcta ggggatccct 
tacaactcca gagaagagga ttcttcatat 
atagttgcaa ggaagccccg tgtgttttga 
tgccaacatc agggtcaaag gaaaagacca 
ctccttttaa ggaaagacca acaccttctc 
ccttggtcct ttacaataga gtggctgttc 
agaaagcacc aaaggaagat gtagatgcag 
aatataagga gaaaactggc caggaatata 
agaatatttc ttctaattcc tcagcaagta 
ttgctgcaca aggggaggtg gttcgtaagc 
taaatgaagc tgtagaatgc ttactgtccc 
aggagtacat acctggtcag cccccattat 
attctgaacc tgctggttta gaaacaccag 
ctcaagggga agtagttcgg aaacttaaaa 
tagctgttca agaactcctt cagctaaagg 
ataagcctgt gtcggccact ggagctgagg 
ataaatctga aaagcagaat aagcctcaga 
ctaaaaacca aggaggtggg ctctcatcaa 
aacagaccag gttgggtctt gaggcaaaaa 
aggtcatcac aaagtcagaa atgattgaat 
gtccctgggc ctatgccatt tgggaagcca 
aacttggtgt tgaaaactgc tacttcccca 
agaagactca tgttgctgac tttgccccag 
ccgagctggc agaaccaatt gccattcgtc 



ggtcctccgt agttctggca cgagccaggc 60 
agatggcgac gctctctctg accgtgaatt 120 
cagtagaaca cgtgaaagac gatgtcagca 180 
ttcatgtttc tgaaaatgtg atattcacag 240 
gagttgcaac tacagctggg ttatatggct 3 00 
actggttgga gttcagtgct acaaaattat 3 60 
atgaactcaa tcattgcctg tctctgagaa 420 
cagatttatg tgtttgggcc accctaaaag 480 
agaagaaagc tccagttcat gtaaaacgtt 540 
tccagtcagt aggtaccaag tgggatgttt 600 
aaaagcaaga tgttgggaaa tttgttgagc 660 
tcagatttcc tccagaggcc agtggttact 720 
tgaaccagca ctaccaggtt aactttaaag 780 
atcctgaaaa agaaaaggaa gattttgaga 840 
atatcaaacc agatcaattt acttatactt 9 00 
cagagaagct aattcaagaa gggaaggctt 960 
aagcagaacg tgagcagagg atagaatcta 1020 
tacaaatgtg ggaagaaatg aaaaaaggga 1080 
caaaaattga catgagtagt aacaatggat 1140 
aaattcaacc acatccaaga actggaaata 1200 
cctgccccat agttgacagc atcgaaggtg 1260 
atgacagaga tgagcagttt tactggatta 1320 
tttgggaata tagtcggcta aatctcaaca 1380 
ggtttgtcaa tgaaggacta gtagatggat 1440 
gtgtactgag aagagggatg acagttgaag 1500 
cctcacgttc agtcgtgaac atggagtggg 1560 
ttgacccagt ggctccacga tatgttgcat 1620 
tacctgaagc tcaggaggag atgaaagaag 1680 
gcttgaagcc tgtgtggtat agtcccaaag 1740 
tttcggaggg tgagatggtt acatttataa 1800 
acaaaaatgc agatggaaaa atcatatctc 1860 
actacaagaa aaccactaag gtcacttggc 1920 
cagtaatctg tgtcacttat gagcacttga 1980 
actttaagca gtatgtcaac aagaacagta 2040 
gccttaagga tttgaaaaaa ggagatatta 2100 
gtgatcaacc ttatgaacct gttagcccat 2160 
tatacattcc tgatgggcac acaaaggaaa 2220 
aagtagaagc cacaaaaaat gagacctctg 2280 
tgaataataa ttgtactaca tctgaggatt 2340 
aaggagatgt ggttcgtgaa ttaaaagcca 2400 
ctgtaaaaca gcttttgtct ttgaaagctg 2460 
aacctggaaa ccctcctgct gaaataggac 2520 
ttctggaaag taaatctctg tatgatgaag 2580 
taaaagctga aaaatcccct aaggctaaaa 2640 
tgaaggctca gtataaagaa aaaactggga 2700 
ctcaaagttc ggattcaagc ccaaccagaa 2760 
aagcgaaagt actttttgac aaagtagctt 2820 
ctgaaaaagc ccctaaggat caagtagata 2880 
cacagtacaa gtctttgata ggagtagagt 2940 
acaaagataa gaagaagaaa gaaaaagaaa 3 000 
aacaaaatga tggccaaagg aaagaccctt 3 060 
gtggagcagg agaagggcag gggcctaaga 3120 
aagaagaaaa tcttgctgat tggtattctc 3180 
accatgacat aagtggctgt tatattcttc 3240 
tcaaggactt ttttgatgct gagatcaaga 3300 
tgtttgtgtc tcaaagtgca ttagagaaag 3360 
aggttgcttg ggttacaaga tctggcaaaa 3420 
ctactagtga aacagtaatg tatcctgcat 3480 
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atgcaaaatg 
tggtgcgttg 
aggaagggca 
ttgacttata 
agacggaaaa 
ctgctagtgg 
aaatgtttga 
aaaactcctg 
acatgggttt 
gcattaccaa 
atcgaaggcg 
ctccaggttg 
ggccacgtga 
tgacagttgc 
tcaccctttt 
tggaagactt 
aaattgactg 
gtgctccatc 
agcctggagc 
gcagctactg 
gatattagta 
aaatgaagtc 
tctgtaaaca 



ggtacagtca 
ggaattcaag 
cagtgctttt 
tgctcaggta 
ggaaaaattt 
aagagctatc 
aatcgttttt 
gggtctgaca 
agtattacca 
tgcactttct 
attactcagt 
gaaattcaat 
tatgaagagc 
tgaaaatgag 
cacaagggct 
tcagaagata 
tgaggactgg 
catgggagct 
caaatgtgtc 
agggatgaac 
tcttctcaga 
acacaggaca 
actccagtaa 



cacagagacc 
catcctcagc 
gctaccatgg 
tatgaagaac 
gcaggaggag 
cagggaggaa 
gaagatccaa 
actcgaacta 
ccccgtgtag 
gaagaagaca 
gttaacatcc 
cactgggagc 
tgtcagtttg 
gcagagacta 
tctgaagacc 
ctagattctg 
atcaaaaaga 
aaaagccttt 
tgtggcaaga 
gaaagccccc 
tacagaccat 
attattctta 
taaatatcat 



tgcccatcaa 
ctttcctacg 
aagaggcagc 
tcctggcaat 
actatacaac 
catcacatca 
agataccagg 
ttggtgttat 
catgtgttca 
aagaagcgct 
gcgttagagc 
tcaagggagt 
tagccgtcag 
aacttcaagc 
ttaagactca 
gaaagattgt 
ccactgccag 
gcatcccctt 
accctgccaa 
tcttcaactc 
tttatgattt 
tgcctaagtt 
gaactaaaaa 



gctcaatcag 
tactcgtgaa 
ggaagaggtc 
tcctgttgtt 
tacaatagaa 
tttagggcag 
agagaagcaa 
gaccatggtt 
ggtggtgatt 
gattgcaaaa 
tgatttacga 
tcccattaga 
acgagatact 
tattttggaa 
tatggttgtg 
tcagattcca 
ggatcaagat 
caaaccactc 
gtactacacc 
ctctcacttt 
tttaaaaagt 
aacagtggat 
aaaaaaaaaa 



tggtgcaatg 
tttctttggc 
ttgcagatac 
aaaggaagaa 
gcatttatat 
aatttttcca 
tttgcctatc 
catggggaca 
attccttgtg 
tgcaatgatt 
gataattatt 
cttgaagttg 
ggagaaaagc 
gacatccagg 
gctaatacaa 
ttctgtgggg 
cttgaacctg 
tgtgaactgc 
ttatttggtc 
ttaaagcatt 
aaaagttcta 
aaaagacttt 



3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4850 



<210> 35 
<211> 1762 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No. 



2742442 



<400> 35 

attgcgcgag 
ggcccgcagc 
cgacgtgcaa 
ggctggaaga 
tggtccggga 
cggccttgag 
aggcgcccgt 
ttccactaca 
ctgcaccatt 
tgcaggaaat 
gcaaaaaatc 
gatttccacc 
taaaggcatc 
gcccaggtct 
ttcgatcact 
tggtagctgc 
gggagttaat 
gcttctgact 
agaaagcatt 
agaatataga 
acattatcat 
tgattcttct 
gaaagaggtt 
ttattaaatt 
tttgagtaat 
aatcccagca 
agcctggtca 
gtggcacaca 
cgggaggcgg 
agggagactc 



cgcacgggaa 
tccgattggc 
ccgggaaggg 
gaccggagcc 
ggtgacgacc 
aagggtagtc 
gcggaatggc 
tacctcaatt 
aataaacaaa 
gtatgatgta 
agatgttata 
tgtgttggag 
ttgtactgat 
tcctggctac 
tctacattcc 
ctttgaaaga 
gcttcatgaa 
ttagtttgtt 
tgttaaacgc 
ctcacttgta 
gttctgcatt 
gagcagaagt 
agttaagata 
cagactattt 
catatattta 
cttggggaag 
acatggcgaa 
cctgtaatcc 
aggttgcagt 
tgtctccaaa 



aagcgattgg 
cgacaggctg 
aagaaggggc 
gggctccggg 
ggcttcggag 
tcgggatgcc 
agatatttag 
ttgcctaagg 
aggaaagaat 
gtatcgggag 
tcaaagagat 
cgatatacat 
gggagacttt 
ccaagaactt 
cagcttgcca 
agagcatgta 
gtccatcaca 
cacttttagg 
agctttggtt 
catagaatta 
acaataatgg 
tgaaactgta 
ttcacacagt 
gtaacttatt 
aagtaaaaac 
ctgaggtggg 
accccatctc 
cagttacttg 
tagccaagat 



tcggtcagga 
acgggaacgt 
gtgtcaggct 
cccgaccaga 
agtctatcat 
gtccgaagtc 
cttcctgtgg 
agatatgtgc 
attcagagag 
tggaggatta 
ctggatattg 
cagtagtaac 
tcaatcattt 
gtaccttgga 
cactcttttt 
agctgtatgg 
cataaaggca 
aagtattttc 
ataaacctgc 
tttcttcaag 
gatgtcatca 
aatttaaacc 
atgtattata 
gttatagggc 
tttgggctgg 
cagatcagtt 
tactaaaaat 

ggaggctgag 

cgccctgctg 



gagagaggtg 
ttacggtcag 
gcgcaggcgg 
ggagggcggt 
ggcagctcgg 
ggcgacagcg 
tatactgatg 
acgaactttc 
aagaatttta 
caagcatttt 
taaaacaaga 
cttggtgaaa 
ggagactatt 
tttttcaatt 
tgatgaagtt 
tccagaaaca 
aaaaagaact 
atgacatgtt 
accattgaaa 
tataattcaa 
ccattgctag 
ttttaattat 
ttaaccatat 
ctgccgtatg 
gcacagtggc 
gaggtcagga 
acaaaaatta 
gcacaagaat 
cactccagcc 



tgtcctggcg 
cgtgtgtcag 
ccagtccatt 
gctgcagggc 
actggtcata 
gccggggcgc 
agcagaactc 
ttcaaaatca 
ggatattcaa 
gttccttggt 
ttagaaattg 
cctcatttag 
tggtgtttta 
tcttttgaat 
gtgaagcaga 
aatatacctc 
ggtgccacct 
ttcagaagcc 
atttgcacat 
aataatatgg 
aatactggca 
caccttacct 
cacacttaag 
gcttaggata 
tcacacctgt 
gttctagacc 
gctgggcgtg 
cgcttgaacc 
tgggcaacag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1762 
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<210> 36 
<211> 2862 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 3511087 

<400> 36 



ctaagctcag 
acccatgaca 
agtcagagga 
gttaggaacc 
tccacctttg 
tgaatccaaa 
taattcttcc 
gggaatttca 
accatcttca 
ttgccagatc 
catagatcat 
atcatggaaa 
atgtaaagat 
ttgccaagag 
tttctttggc 
tggggaattt 
aagaaataaa 
caataagtgc 
ttttcatggt 
ggaagatgag 
tgaaacaggt 
ggaagcaaat 
tatgaagata 
caaaaacaga 
gatgagagtg 
gaaacaaatg 
atatgttgga 
gaaggagaat 
aaatgaaaaa 
catgccaaca 
ctgcaacagt 
tttccacacc 
tatttgtgat 
tggaaatgcc 
caaagcacga 
tttgtttaaa 
gaccagaaaa 
gtattggcgt 
ttcacaggcc 
cccggttgcc 
tatatctagg 
aaacacttgt 
atgaatttgt 
ttttcttaac 
tatttttctg 
aatattttga 
gattttgaaa 
tgtaataaag 



aattcggctc 
gcttcatgaa 
ccccttcaga 
ttacccattg 
acttcatcct 
cgagatctgt 
cttttactta 
agtcagccta 
tcagtgactc 
tgttttgtaa 
aagtgtaaga 
aaaatacgtc 
gttgctgctg 
gagatagatg 
ggcaatggaa 
atattccttt 
gataattcta 
cttgtccaca 
cagtgtcagc 
tgcttttatg 
atctcacatg 
gtacctggag 
aagtttgtgt 
aaatattgta 
atgtctattg 
cctttacagt 
aactgttcct 
gggatacaag 
agtgaagaca 
gattatgctg 
gagaagcagt 
gaggacgacc 
aggtatatga 
gaacttcatg 
aaagatcact 
gatttaaact 
attgaaagtg 
atatcgttcc 
tggccacgct 
atctgttgaa 
acaactcttt 
actgggatca 
ttaatataat 
tcagatttta 
tatttgtatg 
aaaataattg 
gattaaacct 
attttatata 



gagtgctttt 
ttttaatcac 
cagccagtgt 
gtgcgaggta 
tagaagattt 
ccacctcaac 
tgaatggacc 
gaaatgactt 
caagacatcc 
aatcaggccc 
aagatatttt 
caagaccaac 
aggaggaatg 
tgtggacact 
agattaacct 
gtgagaaatg 
ctgcttgttc 
ttttgcgaga 
ttgatttatg 
cccatagtct 
atgctattgc 
cgcaggtact 
gcgcccagtg 
gtgcaaaagc 
aacgtaagaa 
ttgatctgtg 
ttgctcatag 
atatggagca 
tagccagtca 
aagttacagt 
ggcagggcca 
agtactgctg 
atggcacctg 
aatgggaaga 
taattggccc 
aatatgctgg 
ttctgaggca 
tec t gage ag 
ctcacggaac 
cagacttttg 
gagttggtcc 
gaatacatga 
aaagtttgct 
agecaaataa 
tttcactata 
actcaaattt 
gtactgttgg 
tagatgeaaa 



atttgctggt 
atcttttttt 
ctctcctagc 
tgctcctcca 
ttgttcttct 
ttctagagag 
aggtagtttg 
tggaaacttt 
cctcgaagga 
taagttaatg 
aateggtagg 
aaaaacaaat 
tagatattca 
ggagcggaaa 
tactgtgttc 
ttttgatcat 
tcacccggtt 
gacaacagta 
tcgacatgaa 
tgtggaactg 
tcaagagtct 
tggtaatcaa 
tctgagaaac 
aaggcattcg 
gtggatgaac 
caaccatatt 
tcctgaggaa 
attttacgaa 
gtcaaacaag 
ggactttcac 
catctcctcc 
gcagcaccgc 
cccagaagga 
aagaagagat 
aaatgataat 
cttttatgta 
catagcagag 
caacccacag 
cactggcatc 
gatgaagtgt 
ttcatataag 
tggatgaaat 
acttatctgt 
ccatttaaca 
ataagggaat 
tatttcttgg 
tattgtgtta 
aaaaaaaaaa 



gttgaaagta 
cttccgcagc 
atgccctttt 
ccctccttct 
ttaaattcat 
ggaacaccgc 
tttgcttcag 
tttggaagtg 
acccatgaat 
gatttcactt 
ataaagaatg 
tatgaaggac 
ggccactgca 
ggagcattca 
aaacttctcc 
aagcctagaa 
acaaagcatg 
aaatactcca 
gttcggtatg 
aaagtctgga 
aaacgatatt 
ataatgectg 
ggtcaagtca 
tggaccaaag 
atccgtcctc 
gcttctggga 
agagaagttt 
ctatggctca 
gaaaatggaa 
tgctggatgt 
gagaagcaca 
ttcccaacag 
aacagctgta 
gecctaaaga 
gactttggaa 
tgatacctaa 
gagctgeaga 
taggtaggaa 
agatggtgaa 
gttggggaag 
aatcgtgacg 
tctttacatg 
atgtaggttg 
ctagtatttg 
taaggataat 
tettttgetg 
gtgtatggac 
aa 



gttcagccaa 
cgtcagcttt 
cggcatcgct 
cagaatttta 
tttcaatgag 
ttaacaacag 
agaatttcct 
cagttaccaa 
tgagacaagc 
accatgetaa 
ttgaagataa 
catattatat 
cgtttgctta 
geegggagge 
aggagcatct 
tgataagtaa 
agtttgaaga 
aaatacgttc 
gctgtttaag 
taatgcaaaa 
ggcagaattt 
gatttcttaa 
ttgaaccaga 
accggcgtgc 
tccccacaaa 
aaaaatgtca 
ggacttacat 
agagtcaaaa 
aacaaattca 
gtgggaaaaa 
aagagaaggt 
gctatttcag 
aatttgeaca 
tgaagctcaa 
aatatagttt 
tcagagcatt 
tttcctgett 
aatgggctgt 
gtgactgeta 
aggataaggt 
gtaagagaat 
ttttagcaga 
ctaaaaagga 
ttaaatgggg 
gtgeattgag 
tttaaatgat 
caatactgcc 



60 
120 
180 • 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
.2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2862 



<210> 37 
<211> 1263 
<212> DNA 

<213> Homo sapiens 



12/52 



WO 01/36684 



PCT/USOO/31743 



<220> 

<221> misc_ feature 
<223> Incyte ID No. 

<400> 37 



1968009 



ccgtccccat 
cccctctcct 
ctcgcagtca 
cttgttttct 
gcaggacact 
tgcttctggt 
gctgctcttt 
cgtttgggtt 
ccactctgga 
caaagttgga 
ctgatgaaga 
gcagcatttt 
tggaaaatca 
ataagaccct 
ccactatttt 
taaaaaaaaa 
cagaggtggc 
tatattgatg 
gtgtcagctt 
gtcttttagg 
gaaattatgt 
aaa 



tctctgaccg 
cgccccgcca 
ttcctgcggc 
gcaggtgcag 
ggctcagtag 
gggatcattg 
ggcagtctag 
ttcctagcta 
aaattcatgc 
gttagtatgt 
attaaaaatc 
tgcatctgac 
gtcatgatta 
catagagctt 
caatatatta 
gacaccaaac 
gagtatgtaa 
ttgtcttttc 
tcagggctct 
agatttacaa 
taagtgaaat 



cccctctccc 
ccgggacgga 
ttgcgcgccc 
gcctggggta 
tgcctttgca 
gctatgtaaa 
ccggcctggg 
catctggtac 
ctgcaggttt 
tcaacagacc 
tgcatcttcc 
attttaccta 
caaacctaca 
gattcttgta 
agagaaataa 
ttggcagaga 
cacaagagct 
tttctgtatc 
gaaaccccat 
tatctgttct 
atcaatgaaa 



ggtacactgc 
gagcgcccgc 
ttgtagacag 
gtctcctgtc 
ttggtttggc 
agcaggcagc 
tgcttaccag 
cttggctggc 
aattgcaggt 
ccattagcag 
actattttca 
aaaaaaaaga 
gaggtggcga 
tattgatgaa 
gtgcagcatt 
ggtggaaaat 
taataagacc 
tgtaggtaaa 
tccctgctct 
tttgctcatc 
ataaagttta 



gcaggcacaa 
cgctgcattt 
ccggggcctt 
tggacagaga 
tttggctacg 
gtgccgtccc 
ctgtctcagg 
attatgggaa 
gccagtttgc 
aagtcatgtt 
atatattaag 
caccaaactt 
gtatgtaaca 
gaattaaaaa 
tttgcatctg 
cagtcatgat 
ctcatagagc 
tctcaagggt 
gaggaacagt 
ttagaccaca 
ctataaataa 



cagagccgct 
ccggcgacac 
cgtgagaccg 
agagaaaaat 
cagcactggt 
tggctgcagg 
atccaaggaa 
tgaggttcta 
tgatggtcgc 
ccagcttaga 
agaaataagt 
ggcagagagg 
caagagctta 
tctgcatctt 
acattttacc 
tacaaaccta 
ttgattcttg 
aaaatgttag 
gtgaaaaaaa 
gactgacttt 
taaaaaaaaa 



60 
120 . 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1263 



<210> 38 
<211> 978 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1923127 

<400> 38 

ctcgagccgc gcggccccgg ggcgcacgcg cacgcaatcg cgtttccgga gagacctggc 60 
tgctgtgtcc cgcggcttgc gctccgtagt ggactccgcg ggccttcggc agatgcaggc 120 
ctggggtagt ctcctttctg gactgagaag agaagaatgg agaagcccct cttcccatta 180 
gtgcctttgc attggtttgg ctttggctac acagcactgg ttgtttctgg tgggatcgtt 240 
ggctatgtaa aaacaggcag cgtgccgtcc ctggctgcag ggctgctctt cggcagtcta 300 
gccggcctgg gtgcttacca gctgtatcag gatccaagga acgtttgggg tttcctagcc 360 
gctacatctg ttacttttgt tggtgttatg ggaatgagat cctactacta tggaaaattc 420 
atgcctgtag gtttaattgc aggtgccagt ttgctgatgg ccgccaaagt tggagttcgt 480 
atgttgatga catctgatta gcagaagtca tgttccagct tggactcatg aaggattaaa 540 
aatctgcatc ttccactatt ttcaatgtat taagagaaat aagtgcagca tttttgcatc 600 
tgacatttta cctaaaaaaa aaaagacacc aaatttggcg gaggggtgga aaatcagttg 660 
ttaccattat aaccctacag aggtggtgag catgtaacat gagcttattg agaccatcat 720 
agagatcgat tcttgtatat tgattttatc tctttctgta tctataggta aatctcaagg 780 
gtaaaatgtt aggtgttgac attgagaacc ctgaaacccc attccctgct cagaggaaca 840 
gtgtgaaaaa aaatctcttg agagatttag aatatctttt cttttgctca tcttagacca 900 
cagactgact ttgaaattat gttaagtgaa atatcaatga aaataaagtt tactataaat 960 
aataaaaaaa aaaaaaaa 978 



<210> 39 

<211> 851 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 3123954 



13/52 



WO 01/36684 



PCTYUS00/31743 



<400> 39 

cggcacgcgt 

tcccgcggtc 

gatcgtgcgc 

gcatggcggc 

gcctgggttc 

ttccccaccc 

tc gggaagtc 

ctccccaggc 
agcccggcgg 
caccagcccc 
aatccgcatc 
gcagcaagca 
tctaccgccc 
ccaagcggaa 
aaaaaaaaag 



ggggtccgcg 
ttctctgcaa 
cgaggcccgc 
catcccctcc 
cacttccagc 
cccaggtctc 
caccctcccg 
ctccagcagc 
tcagtccagc 
aggcctgcgg 
cctcgccccc 
gaccttcgca 
ggccccagca 
aataaactcc 

g 



cgtgcgcacc 
atgggctccg 

gaggggtcgc 

agcggctcgc 
aacagctcct 
cccaaggctg 
ttcatggcca 
atgaccgcct 
acagccagcg 
aggcgctagt 
ctccccacct 
tcaacacagc 
ctcgctagct 
aagcagccag 



ccgcgcgcgc 
tggcctagcg 
cgcccagatc 
tcgtggccac 
gcagcagtac 
acccgggtca 
cggtgttgga 
gtggcctggc 
ctgggccccc 
ccaccagagc 
cccacccccc 
agacaccaaa 
ttcctgacac 
taaaaaaaaa 



ctctctgtcg 
cccccgtccc 
ccaccagcca 
ccacgactac 
cgagtgcccc 
ttggtgggcc 
gtccgcagag 
tcgggacgcc 
gtcctgacct 
ccctccccgc 
accctgtaaa 
aaccagtgag 
ctggaactgt 
aaaaaaaaaa 



tggcgcggct 60 
cgccacccgt 120 
gcaagctaaa 180 
taccggcgcc 240 
ggggaagcca 3 00 
agcttctttt 360 
cactcggaac 420 
ccgaggaagc 480 
gagcggttac 540 
ccctntcccn 600 
ctaggcggct 660 
agccccgctc 72 0 
gcacctggca 780 
aaaaaaaaaa 840 
851 



<210> 40 

<211> 1907 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No. 



1321844 



<400> 40 



tggaccgacg 
gggaactgcg 
ggcctccggg 
aaattcggca 
gatggccaag 
gaccgggaca 
atgtttcaac 
tcctgcgacc 
aaatgccgga 
cactgcccga 
tgctacgggt 
gacccggatc 
cgagagcccc 
ctgcccaaag 
tgcttgagcc 
gaggaggagg 
ctgccaggtg 
tgagctcaaa 

gggggattcc 

gtgggacaca 
tgaccaatgt 
gaatgagcat 
caccatcctg 
agtgggcaca 
ggagagagag 
caaggctttt 
ttccatctcc 
ccttaattcc 
caaactagga 
ataagagtac 
tgcctcatca 
tttgttctct 



ggcgcaccca 
ccgccgccac 
agaagtttca 
gcgaccacac 
aactaagtaa 
ttcaggcagt 
gtgcccagga 
acgtctggtg 
agcgctacga 
agtgtcggca 
gcggcttccc 
gccgcagcac 
acgtgcctgg 
tgctccaccc 
agggtggcct 
aggaagagga 
cagatacaaa 
ccgagatatg 
tgggtcccat 
ttggcactga 
cctctagggc 
ggcggccttc 
gtagcggctt 
ggcatggtac 
agtccatgtc 
gccaggcatc 
tcagcaaaaa 
ttccctatct 
gtctaccgtt 
tagctctcac 
gagaaggaat 
tgacggaata 



ggtagggggg 
catgtctcag 
tgggaaggta 
gggagtgggg 
cgatctggat 
ggcgacctcc 
cgaccttatc 
gcgccgcgtg 
gccagtgcca 
caacttccgg 
cgtgtatcca 
ccacactcac 
gacatcctgt 
cagcaaccct 
cctggaagac 
ggaggtggag 
ccagacacgg 
aatgaccttg 
tttcagcgcc 
gccacaaaga 
ctaggggaca 
ctgagagaat 
cggtagtggc 
caccagcctc 
ctctcaccag 
ccctggcccc 
aataggagcc 
ccttaccaaa 
cattccttta 
cctctgccct 
ctggactccc 
aaagcttgct 



cggctgagcc 
gaaggtgtgg 
tcctccaaga 
cgctccatcg 
gcccaggatc 
ctcctgccac 
cctgctgtgg 
ccccagcgga 
gctgacaaga 
ggctgggcac 
acacggatcc 
tcctgctcag 
gctcacccca 
cacattagca 
ctggacaacc 
gacgaggagg 
tctgtggcta 
gggagccatc 
cagggtcaca 
aggtgtggcc 
gaggaacaca 
atgccccacc 
cgccgtggtg 
cccgctggta 
aataaaagcc 
tcccattctt 
ctggcccccc 
gtacaagtca 
tcaaagaaaa 
ttacttgaac 
catcccccca 
tatccttaaa 



gcgcagtgcg 
agctggagaa 
aggcgggggc 
tgtacggggt 
caccagaaga 
tgacagaagc 
accggcagtt 
aggaggtatc 
tgtggggcct 
agatggggtc 
tccccccgcg 
ctgccgactg 
agagccggaa 
gtggctccac 
tcatcctgga 
gcgggcccag 
ctttgtgtta 
tgaggccaag 
gatccacagt 
agaacaactt 
gagtcacagc 
acgaaactca 
ccacacaccg 
cagggcacag 
tctacctgca 
attgaataca 
aactttcttc 
catctttccc 
gtatctactt 
aggagtcttg 
ccaggataaa 
aaaaaaa 



gaccctcgcg 
gagcgtccgg 
tctgatgagg 
aaagcaaaaa 
tatgaagcag 
caacctacgc 
tgcctgctcc 
ccggtgccgg 
ggctgagttc 
cccgtccccc 
ctgggaccgg 
ctacaaccgg 
gcagaaccac 
tgtggccacc 
ggacctgaag 
ggagtgaccc 
ttataagata 
atattgacgg 
gggaagttct 
gggctcctgc 
ttcaggggcc 
gcccagtaga 
ttgaggttgg 
ttacctgagg 
cctcacagtg 
agccctgatc 
agagtaatag 
accttttctg 
cctttctaga 
attctttttt 
agtcctgacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1907 



<210> 41 
<211> 3689 
<212> DNA 



14/52 



WO 01/36684 



PCT/US00/31743 



<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 375724.9 

<400> 41 

tgggtccccc aggagagcct ctaaggtcac 
gcatggtgcc tcctccctga caaccacctc 
ggcaccagca tggatctctg cctgctgccg 
atcagttctt tataccaagt cctttgtgaa 
tgtccacctg gcttccaaga tagaagaggc 
gtttcaccgc cttcgacagc tgagagacaa 
gagtgtgagc gtaaccaaca cctggtccag 
ctaagacttc tcctctgctg tccaagcgct 
atccagtgca ggctctccac cgaaggctgg 
gacagttggc agcggctctg atgagcccga 
cctccgcacg atgctcccac gcgtccaact 
tggactcctt caaggaagct gcagctcggt 
tggaggaaga ggaagaggaa cccaagagga 
tgtgctcagc cagttcattt gagtttgcat 
agatttcttt aatcaaaagc gttttagtga 
gtgctttgca tgagagcagg ccaaggttga 
ttcatagcaa cacgtattgt ctgacattca 
gcctttctgt cctgtattta ctgtatttag 
tcttgagatt cacaggactg tctagctctg 
gtacgtttat ctagattttc tctagataat 
ggcattgcag ttcttcctgt gtttggcttc 
cgacgtcttc gtggcggttc cagccagaga 
cccggacgct ggagatccct ttgcccaatc 
ctgaagaaga aattcaggaa atctgcttaa 
ttgatctcac acacctggag ggtgaagtgg 
aaggcccaag cccggggcct gttgcctggg 
ttctctcctg cccccaagct ggtggaatcc 
ccactgtctg tgaagaacac caagaggagg 
agccccgtga acggcttgcc aaaggggcga 
cagagctact cgaggtcccc atcccgatca 
cggctccaca tctggtgggt ccaagttcgc 
caccgagaca ggccccccgc agcgctccct 
agtccaagga ctgcaagtac ccccagaagc 
cgttctcgaa gcaggtcact gggagcgggc 
tcattactac agagatcagc gacgagagcg 
ctatgagcgg gaccaccctg ggcacagcag 
tggtggccgc aagcccttcc ctggggagta 
tgccctttga cccccgggtg tgctctcagc 
gaaatgtctg tcgactggac cttggtggat 
gtgcttcaga ccggtctggg gtgcggcgca 
agcagctctg agggcagctc aatgaaaaag 
ggcatggcct ggtgctatcg gcagccgctc 
aagccaagaa agatgatttt tagaaccttt 
tttgcagtgt ttcacaggag aaagtggcct 
aaataaacat gtgtttaata caagttaaag 
gtcagcttaa aacttgtgta gggaatcctg 
tgttgaaagt cgtatatttt tatctgtgcg 
agaggttccc tatatgcgcg tatgacatgg 
atccggagcg ttatggacgt ggtaggtggt 
gtgtgcacag ccagtgcggc ccacttccgg 
ggattttgca tccaatcctg tgtgcctgcc 
cctctgccca tcaggatgac aggctccttc 
acacagaatg attccaaggt gctctcgttc 
tctgtgggcg agagaggcag ctgggagcag 
cacctgcact ctgagtgccc cccactgtca 
ggttttgggg actagggggc cccactcctg 
ggacatatag gccggggcta gtgggataac 
gaatcctgca ttagatagct ggtgggaccc 
ccagggaggg tgtccttcca gggagagcag 
ggaagtaaga gaaccagtaa agggccatac 



acagggtgcc cactgcagac aggctatagt 60 
cacttcacac cagccacagc aaggaacttt 120 
atggcatgac tgtcgaacag gtggtgttcc 180 
gcattccaca gagcatgtgt caatggcctg 240 
cccaagacgc atacgggacg tcatcaatgt 300 
aaaataatcg ttatgtacct tcaggtgtta 360 
acctcatggg tagcctctga gggtaagtga 420 
ttggtgcagg gacagcggca tcttcagcca 480 
ctctagactg gtgacccctt gttgaaatgg 540 
gaagaggcct gcccttgggt gcggagtctc 600 
tgcacccaag gggcttttcc ctcttccaag 660 
cagcagagaa ggggcctgcc gccagcgccc 720 
tggcttgtct cccagcagcc acaccggctt 780 
gtttctctgc actatggatt ttgagcattt 840 
ctccagtaga cattttcttt ctgaggcatc 900 
ggggaaaagt aaagttaaag tcggttctct 960 
gccagctttt ttttttctaa taatttctgt 1020 
aaaaagcagc tagaatattt ctccattaac 1080 
agtcctagca atagactcct tagaggagta 1140 
gcaggcggaa gacctgggtt ccccgggtgg 1200 
caggaattac atgaacgaca gccttcgcac 12 60 
gcatcgcctg tgcctgcatt tatcttgctg 1320 
gtccccattg gtttcttttg tttggagcaa 13 80 
agatcttgca gctttatgct cggaaaaagg 1440 
aaaaaagaaa gcacgctatc gaagagggca 1500 
ggcacacagg tgctggatgg tacctcgggg 1560 
cccaaagaag gtaaagggag caagccttcc 1620 
ctggagggcg ccaagaaagc caaggcggac 1680 
gagagtcgga gtcggagccg gagccgtgag 1740 
gcgtctgcct aagaggagga aaagtgacag 1800 
agagccgctc ccggagcagg agtgactccc 1860 
acaaaggctc tgagattcgg ggctcccgga 1920 
cacacaagtc tcggagcccg gagttcttcc 1980 
ggataatccg ggaaaataca agaagaaaag 2040 
ctcgaggtcg tatgaacgca caggccgtcg 2100 
gcatcggagg tgaggcgggg ttgcagtgac 2160 
cctgatggct gccctttgac ccccggtggc 2220 
gcaagtggtc ctagaacagg attctttttg 2280 
ttggaaatgg aactgaggga ccggtgacac 2340 
cacctgggcc cgtgcagggc tcagctcggc 2400 
tgaatgcaca cgcccttgtt ggcgtggcct 2460 
tccactcccc gactgatact caattacgtg 2520 
gcctatatta ggttgtactt atgtacatat 2580 
taactgcccc ttattctctc tccacgttgt 2640 
ctatgtatga aaactcagaa cttgaatccc 2700 
acttttaaaa tgtgagggta tttggatctg 2760 
gtgctgagtg caggccacca gctcctaaat 2820 
tgaataaaca caactctctc cactcaggac 2880 
cgttctgtgt gcttgtgaaa gtgtccaggc 2940 
gctccttgct ccctgctgta ctgaagtttt 3000 
cttctgccga agcttgtgag gggcctgagt 3060 
ctgcagggcc ataggaggga agttttggaa 3120 
ctgaggggga ctggtttgta acccatgaca 3180 
gacacttgga gggtcacccc acgggggtgg 3240 
tcagctgcct cttaccgtgg acacagtttt 33 00 
gtggtaccgt ttggacttac tagggcagtg 33 60 
ggggagttac gcctgatgac ttttttgatg 3420 
ccccctcaga attggggaac tgaggagact 3480 
ctatgagggg ccccctagct tcctgtgcct 3 540 
acacctgtac ccaagagacc gctctccatt 3 600 



15/52 
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tgctttcttt ttttactaaa taattgtaaa atattattat gacataaaga accatttaag 3660 
gccanaaana anaagactna naaaaaaag 3 689 



<210> 42 

<211> 408 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 375724.3 

<400> 42 

tttgtattaa acacatgttt atttacaacg 
ctttctcctg tgaaacactg caaaatatgt 
tctaaaaatc atctttcttg gcttcacgta 
ctgccgatag caccaggcca tgcaggccac 
attgagctgc cctcagagct gctgccgagc 
ccagaagacg tcggtgcgaa ggctgtcgtt 
ttggttacgc tcacactcta acacctgaag 



tggagagaga ataaggggca gttaaggcca 60 

acataagtac aacctaatat aggcaaaggt 120 

attgagtatc agtcggggag tggagagcgg 180 

gctaacaagg gcgtgtgcat tcactttttc 240 

tgagccctgc acgggcccag gtgtgcgccg 300 

catgtaattc catgaggtct ggaccaggtg 3 60 

gtacataacg attatctt 408 



<210> 43 
<211> 3136 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No. 

<400> 43 



1867333 



cgacgccggc 
cctctgcaaa 
tgtcaaacgg 
acaagctttt 
ctttgggggc 
ggaacaacat 
tgacacagga 
ggacc agate 
ggtactggaa 
cattcagggc 
gctggttaag 
gcccgtggtt 
agcatccacc 
ggcagtactc 
catgcacgcc 
cagtgaaatt 
aaaggecact 
gaagaagggg 
gttcaacacc 
gcaactgggg 
caccaaggaa 
ggaccaggcg 
tcttctcttc 
ccggtccacc 
ggagctgaag 
ctctatgtca 
tgeaggatec 
ggctgaccat 
ccccatcccc 
ectgeaaggt 
taagaatctt 
agtcagagga 
tgcattagtt 



gtgatgtggc 
gtttacttgg 
cccccagcgc 
tcagccaacc 
ctggctgcag 
accaaggcag 
atccattaca 
actgaagggc 
gggcccaatg 
ctcaaggaga 
gtggtatcca 
cagctcctcg 
cagagectgg 
agctacatct 
ctgctggtca 
gccttccaca 
gtgcagagtg 
catgagctgg 
tatgaacacc 
aeggtgegge 
gacctgcatc 
atggagcgct 
ttcgctttcc 
atgatcatgc 
ggaaagcggg 
gtggtcctga 
ccactcacca 
gacctgggcc 
aacctctatc 
gccctgctgt 
gattctagga 
atttgeccaa 
ccttgcacgt 



ttccgctggt 
gactattctc 
ccc tggtaac 
aagtgccgga 
ctgeaattet 

ggggctgctg 

ttgggcgtat 
agctggactg 
gecgaaagga 
agtttccaca 
gtggagcccc 
acaggtgtgg 
ctgaggtcct 
tccccactta 
accactacat 
ccatccctgt 
tgttgctgga 
tgaacatcta 
tactgeeggg 
ceggcttagg 
tgccgtccac 
acgtctccat 
catcagccaa 
tcatacccac 
gcagtgacta 
aactgttccc 
accagttcta 
gcctgcaccc 
tgacaggeca 
gcagcagcgc 
tccgggcaca 
tggctggggc 
ataaagcact 



gctgctcctg 
tggcagctcc 
tgacaaggag 
gaagctggat 
agctaaagct 
tcataccttt 
ggaagagggc 
ggctcccctg 
gtaccccatg 
ggaggaagct 
tcatgccatc 
getgetgact 
gcagcagctg 
cggtgtcacc 
gaaaggaggc 
gattcagegg 
ctcagctggg 
ttgccccatc 
gaacgcccgc 
catgacctct 
caactactat 
geccagggaa 
agatccgacc 
tgectacgag 
tgagaccttc 
acagctggag 
tetggctget 
ttgtgtgatg 
ggatatcttc 
catcctgaag 
gaagaaaaag 
atctcccttg 
ctaatttggt 



gctgtgctgc 
ccgaatcctt 
gecaggaaga 
gtggtggtaa 
ggcaagegag 
ggaaagaatg 
ageattggee 
tcctctcctt 
tacagtggag 
atcattgaca 
ctgttgaaat 
cgtttctctc 
ggggcctcct 
cccaaccaca 
ttttatcccc 
gctgggggcg 
aaagcctgtg 
gtggtctcca 
tgcctgccag 
gttttcatct 
gtttactatg 
gaggctgegg 
tgggaggacc 
tggtttgagg 
aaaaactcct 
gggaaggtgg 
ccccgaggtg 
gcctccttga 
acctgtggac 
eggaacttgt 
aattagttcc 
acttacccat 
tetgatgect 



tgctggccgt 
tctccgaaga 
aggttctcaa 
ttggcagtgg 
tcctggtgct 
gecttgaatt 
gttttatctt 
ttgacatcat 
agaaagecta 
agtatataaa 
tcctcccatt 
cattccttca 
ctgagctcca 
gtgectttte 
gagggggttc 
ctgtcctcac 
gtgtcagtgt 
aegcaggact 
gtgtgaagca 
gectgegagg 
acaeggacat 
aacacatccc 
gattcccagg 
agtggcaggc 
ttgtggaagc 
agagtgtgac 
cctgctacgg 
gggeccagag 
tggtegggge 
actcagacct 
atcagggagg 
aatgtctttc 
gaagagaggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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ctagtttaaa 
gtgagatctt 
gcttgatgtc 
atcaaagcga 
tgggttcagt 
ctacactgtc 
tggttgccat 
cttcagggga 
gcaggatcca 
cccgggtctg 
tcagtacctg 
gaagggtgac 
gcaggctgca 
taagggaggg 
tggcatttgc 
ggtttggaaa 
gtagttcgtg 
tttattttgg 
tcagtaaaac 
aattataaaa 



tcacaattcc 
tacgcctttt 
tcatgacgag 
atattccatc 
tctgtcctga 
aagggaaaag 
tccataggtt 
caggaaatgc 
tcagttagta 
tcttccttat 
agcctcaatc 
atcaggagtc 
gggcattcca 
aagtcacatc 
agttaattag 
aatgactttt 
ttaatgtaac 
ttttttgttc 
actatatact 
aaaaaa 



gaatctgggg 
ataacatgcc 
cggcgctctg 
tgtggataga 
ggcttctgct 
ggagactaat 
tggagagc tc 
ctgtgtctgg 
gggtgcatgt 
catcggggtg 
aagccttatc 
agggcatgga 
gccaagggca 
agaaaaggga 
cacatgtgag 
cagttatgtc 
attaataaat 
aagaaaacaa 
gaataacacc 



caatggaatc 
atccctacta 
catccctcac 
acccctggca 
ctcattcatt 
gaggcttaac 
tagatctctt 
ccagtgtggt 
cagatgatca 
gcagctggtt 
caccaaatac 
ctggtaagat 
cagcagggga 
aagccacgga 
ggttagacag 
tttggtatca 
ttattgattc 
aactattatt 
aaaataagct 



actgcttcca 
ataggatatt 
ccatgcctcc 
gtgttgtcag 
tagtgctacg 
tcaaaacctg 
ttgtgctggg 
tctggagctt 
tatccaattc 
ctcaatgtgc 
acagggaagg 
gaatactttg 
cagtgcaggg 
atgtgtgtga 
gtaggtgaat 
gacatacgaa 
cattgcttta 
gtgatggcat 
ttaaaaaaat 



gctggggcag 
gacttggata 
taactcagtg 
ctcaacctgg 
ctgcacagtt 
ggcatggttt 
ttcagtggct 

tggggtaaca 

atatggaagt 
cagcagggac 
gtgatgcagg 
ctgggctgaa 
aggtgtgggg 
agcccagaaa 
gcaagctcaa 
aggtctcttt 
acatttgaaa 
ttgcagaagc 
aaaattaagt 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3136 



<210> 44 
<211> 1376 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 1461451 

<400> 44 

ccacgcgtcc gcggacggtg ggtcgcccac gcgtccgccc acgcgtccgc ccacgcgtcc 60 

gatgagatcc cggcctcagg gtggacgcag tggttctgca ctgaggccct cgtcatggtg 120 

gcgcctgtgt ggtacttggt agcggcggct ctgctagtcg gctttatcct cttcctgact 180 

cgcagccggg gccgggcggc atcagccggc caagagccac tgcacaatga ggagctggca 240 

ggagcaggcc gggtggccca gcctgggccc ctggagcctg aggagccgag agctggaggc 3 00 

aggcctcggc gccggaggga cctgggcagc cgcctacagg cccagcgtcg agcccagcgg 3 60 

gtggcctggg cagaagcaga tgagaacgag gaggaagctg tcatcctagc ccaggaggag 420 

gaaggtgtcg agaagccagc ggaaactcac ctgtcgggga aaattggagc taagaaactg 480 

cggaagctgg aggagaaaca agcgcgaaag gcccagcgtg aggcagagga ggctgaacgt 540 

gaggagcgga aacgactcga gtcccagcgc gaagctgagt ggaagaagga ggaggagcgg 600 

cttcgcctgg aggaggagca gaaggaggag gaggagagga aggcccgcga ggagcaggcc 660 

cagcgggagc atgaggagta cctgaaactg aaggaggcct ttgtggtgga ggaggaaggc 720 

gtaggagaga ccatgactga ggaacagtcc cagagcttcc tgacagagtt catcaactac 780 

atcaagcagt ccaaggttgt gctcttggaa gacctggctt cccaggtggg cctacgcact 840 

caggacacca taaatcgcat ccaggacctg ctggctgagg ggactataac aggtgtgatt 9 00 

gacgaccggg gcaagttcat ctacataacc ccagaggaac tggccgccgt ggccaacttc 9 60 

atccgacagc ggggccgggt gtccatcgcc gagcttgccc aagccagcaa ctccctcatc 1020 

gcctggggcc gggagtcccc tgcccaagcc ccagcctgac cccagtcctt ccctcttgga 1080 

ctcagagttg gtgtggccta cctggctata catcttcatc cctccccacc atcctgggga 1140 

agtgatggtg tggccaggca gttatagatt aaaggcctgt gagtactgct gagcttggtg 1200 

tggcttggtg tggcagaagg cctggcctag gatcctagat aagcaggtga aatttaggct 1260 

tcagaatata tccgagaggt ggggagggtc ccttggaagc tggtgaagtc ctgttcttat 1320 

tatgaatcca ttcattcaag aaaatagcct gttgcacatt taaaaaaaaa aaaaaa 1376 



<210> 45 
<211> 649 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 2345712 

<400> 45 
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ctacgacccg 
acccacgagg 
accagattca 
gaggtcccgg 
taagatggtc 
cgtctcaggc 
caaactcttc 
cttggcttca 
gctgttcctg 
agctgccatg 
aggcagccac 



attggcttcg 
ggacgcggcc 
gcccatttgg 
ttcctaacgg 
catctactgg 
ttcctgcttt 
cccttctact 
cagcatgctt 
agccttacgc 
tgggccctgc 
cagggttccg 



ggctcagctg 
gaggagggtc 
ccccgacgcc 
actgcaagat 
tcttgtcagg 
tccgaagcct 
tccacatctc 
gggctcagct 
tggccactgt 
aaaccgtgga 
atccctaccg 



ggaggcggga 
gctgtccacc 
tctgttctcg 
ggaggaaggc 
tgcctggggc 
tccccgacat 
catgggctgt 
cacattctgg 
caacgcccgc 
gaaggagcga' 
ccagctgcga 



cgaattattg 
cgggggcgtg 
gaatccgggt 
gggaacctag 
atgcaaatgt 
accttcggac 
gccttcatca 
gaggccagcc 
tggctggaac 
ggcctgggtg 
gagaaggac 



gttgggggaa 60 
ggagtgaggt 120 
gctgcggatt 180 
gaggcctgat 240 
gggtgacctt 3 00 
tagtgcagag 3 60 
acctctgcat 42 0 
agctttacct 480 
cccgcaccac 540 
gggaggtacc 600 
649 



<210> 46 
<211> 1554 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 1810320 



<400> 46 



ctcccggttc 
gcccgttgtc 
gggggacgac 
agtggccttc 
gcggcacggc 
caggaagctg 
gcccccagct 
tctcattaac 
cttcaagacc 
tgagaagttc 
ccgggggcag 
gcggcacttg 
ccccatcagt 
caaggtcact 
gctctacctg 
cggggcatgg 
tgctaagctc 
agcaccaatc 
actgcgcttt 
ggcccgtgga 
caggtgccac 
tcaagggtgt 
tgtgttctct 
atggtcctcc 
acacactttg 
aataaattgg 



caggcgagtt 
cccgcagtcc 
tgtctccccg 
atcacaggag 
tgccatacgg 
gctggggcca 
gtcatggccg 
tgtgcggccg 
gtgatggaca 
ttccgggacc 
gcgctccagg 
gctgtggagt 
ggcacagagg 
gccagcccgc 
gccagccctc 
ttgacgttcc 
taggaatctt 
tgaaccagca 
ccctccccac 
gctgccacgc 
gcaggtggtg 
cctgcctgcc 
tggctgaaaa 
agctgcccac 
ctacaatctt 
actttgcaaa 



cgcagctgcg 
ccgacgggag 
cgtaccgcca 
gcggctctgg 
tgattgccag 
ccggccggcg 
ccgtggacca 
ggaacttcct 
tcgataccag 
acggaggggt 
tgcatgcagg 
ggggtcccca 
ggctccggcg 
tgcagaggct 
tggcttccta 
caaacggtgt 
ccggccgctg 
atgcctgcag 
ggccccaact 
aggtgcctga 
ggggtacaga 
tgggtccagg 
cactgaggtg 
aaggtccgcc 
gctagtgcgt 
agcttttaga 



cgccgggtcc 
cgccatggcc 
cctcttctgc 
gattgggttc 
taggagcctg 
ctgcctccct 
ggctctgaag 
gtgccccgct 
cggcaccttc 
gatcgtgaac 
ctccgccaag 
aaacatccgc 
actgggtggc 
ggggaacaag 
cgtgacgggg 
caaagggctg 
cttcctgccg 
cccagcccct 
ccagggcagg 
gggccaggtg 
caagatgctg 
gcctgaggga 
ctcccatctg 
cctctgtctc 
tttcttaaaa 
aggaaaagaa 



tggaggccga 
cagccgccgc 
ccggacctgc 
cggattgctg 
ccgcgagtgc 
ctctctatgg 
gagtttggca 
ggcgccttgt 
aatgtgtctc 
atcactgcca 
gccgctgtgg 
gtcaacagcc 
cctcaggcca 
accgagatcg 
gccgtgctgg 
ccggatttcg 
cctcactcag 
cctctgaaca 
agcaactgga 
ccacgcaggt 
ggatgtcccc 
gccacatgga 
tgcgtggccc 
tgcaccacct 
gataatctat 
agaggattaa 



ggccgctccc 
ccgacgtgga 
tgcgggacaa 
agattttcat 
tgacggccgc 
acgtccgagc 
gaatcgacat 
ccttcaacgc 
gtgtgctcta 
ccctggggaa 
acgcgatgac 
tcgcccctgg 
gcctgagcac 
cccacagcgt 
tggccgatgg 
catccttctc 
ccaggtggag 
ctcagctatt 
cagtgggcct 
gtctgaggac 
tgccccatgg 
tcccgagact 
atgagctggg 
gtttgcataa 
ttactgtaaa 
aggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1554 



<210> 47 
<211> 1083 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 964996 



<400> 47 



gagccgtcag tcttacaaag tcgtgactgg 
tgcagcacat gcctctgacc gccttcggca 
gggctcctct ggaaaccagg gacccgatgc 
ggactatagg aaggagggcc ctcctggacc 
gatggcgtgc aacggcggct cgttggggac 



caaaacctgg cgttaccaac ttaatcgcct 60 
cgtccagatt ctgtgggaca tacagggtct 120 
cggagggtag cttggctctg gagcagcctg 180 
cgggagcgga ccctggtggc ggtgaagccc 240 
gtgatccagc gctttgagag gcggggcttc 3 00 
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acgctggtgg 
gacctgcgga 
gtggccatgg 
accgactcgg 
aggaatgtca 
ttccagagca 
gcctgaggct 
agcaagaacc 
cctctgcccc 
ccaagccagc 
gtgggggggg 
ctgtgcccag 
gacgccttta 
aaa 



ggatgaagat 
ggaagccctt 
tctgggaagg 
ctgaggctgc 
tccacgccag 
gtgagctggt 
caagctgccc 
caagcccaca 
accccagccc 
acaagattgg 
gtcttcacat 
cacatgggtg 
tgcaagttgt 



gctgcaggca 
ctaccctgcc 
gtacaatgtc 
cccaggaacc 
cgactccgtg 
gagctgggca 
ttaccacccc 
tccaaacctg 
agaggagttt 
accaatcctt 
tatcataacc 
gtacactaat 
aaggagttga 



ccagagagcg 
ctcatccgct 
gtccgcgcct 
ataaggggtt 
gagggggccc 
gatgggggcc 
atcccccacg 
cctgtcccaa 
gagccaccaa 
tttgcaccaa 
tctcctctaa 
tatgacttcc 
acagtaaaga 



tccttgccga 
acatgagctc 
cgagggccat 
acttcagcgt 
agcgggagat 
agcacagcag 
caggaccaac 
accacttact 
cttcagtgcc 
agtgccggac 
aggggaggca 
cccagctctg 
ggaagttttg 



gcactaccag 
tgggcctgtg 
gattggacac 
ccacatcagc 
ccagctgtgg 
catccaccca 
tacctccgtc 
tccctgttca 
tttctgtacc 
aacctttgtg 
ttaaaattca 
aggtagaaat 
cacaaaaaaa 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1083 



<210> 48 










<211> 1512 










<212> PRT 










<213> Homo sapiens 






<220> 












<221> misc_f eature 






<223> Incyte ID 


No. : 


: 230272: 


<400> 48 










Met Ala 


Thr 


Leu 


Ser 


Leu 


Thr 








5 






Gly Ala 


Leu 


Leu 


Ala 


Val 


Glu 






20 






Ser Val 


Glu 


Glu 


Gly 


Lys 


Glu 








35 






Val He 


Phe 


Thr 


Asp 


Val 


Asn 








50 






Val Ala 


Thr 


Thr 


Ala 


Gly 


Leu 








65 






Thr Glu 


He 


Asp 


His 


Trp 


Leu 








80 






Ser Cys 


Asp 


Ser 


Phe 


Thr 


Ser 








95 






Leu Ser 


Leu 


Arg 


Thr 


Tyr 


Leu 








110 






Asp Leu 


Cys 


Val 


Trp 


Ala 


Thr 








125 






Glu Gin 


Leu 


Lys 


Gin 


Lys 


Lys 








140 






Phe Gly 


Phe 


Leu 


Glu 


Ala 


Gin 








155 






Lys Trp 


Asp 


Val 


Ser 


Thr 


Thr 








170 






Lys Gin 


Asp 


Val 


Gly 


Lys 


Phe 








185 






Gly Lys 


Val 


Thr 


Val 


Arg 


Phe 








200 






His He 


Gly 


His 


Ala 


Lys 


Ala 








215 






Val Asn 


Phe 


Lys 


Gly 


Lys 


Leu 








230 






Pro Glu 


Lys 


Glu 


Lys 


Glu 


Asp 








245 






Val Ala 


Met 


Leu 


His 


He 


Lys 








260 






Asp His 


Phe 


Glu 


Thr 


He 


Met 








275 






Glu Gly 


Lys 


Ala 


Tyr 


Val 


Asp 








290 






Ala Glu 


Arg 


Glu 


Gin 


Arg 


He 



Val Asn 
His Val 
Asn He 
Ser He 
Tyr Gly 
Glu Phe 
Thr He 
Val Gly 
Leu Lys 
Ala Pro 
Gin Ala 
Lys Ala 
Val Glu 
Pro Pro 
Ala Leu 
He Met 
Phe Glu 
Pro Asp 
Lys Tyr 
Asp Thr 
Glu Ser 



Ser Gly 
10 

Lys Asp 

25 
Leu His 

40 
Leu Arg 

55 
Ser Asn 

70 
Ser Ala 

85 

Asn Glu 
100 

Asn Ser 
115 

Gly Asn 
130 

Val His 
145 

Phe Gin 
160 

Arg Val 
175 

Leu Pro 
190 

Glu Ala 
205 

Leu Asn 
220 

Arg Phe 
235 

Lys Val 
250 

Gin Phe 
265 

Ala Glu 
280 

Pro Ala 
295 

Lys His 



Asp Pro 
Asp Val 
Val Ser 
Tyr Leu 
Leu Met 
Thr Lys 
Leu Asn 
Leu Ser 
Ala Ala 
Val Lys 
Ser Val 
Ala Pro 
Gly Ala 
Ser Gly 
Gin His 
Asp Asp 
He Leu 
Thr Tyr 
Lys Leu 
Glu Gin 
Arg Lys 



Pro Leu 
15 

Ser He 
30 

Glu Asn 
45 

Ala Arg 
60 

Glu His 
75 

Leu Ser 
90 

His Cys 

105 
Leu Ala 

120 
Trp Gin 

135 
Arg Trp 

150 
Gly Thr 

165 
Glu Lys 

180 
Glu Met 

195 
Tyr Leu 

210 
Tyr Gin 

225 
Thr Asn 

240 
Glu Asp 

255 
Thr Ser 

270 
He Gin 

285 
Met Lys 

300 
Asn Pro 
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305 








He 


Glu 


Lys 


Asn 


Leu 
320 


Gin 


Met 


Trp 


Gin 


Phe 


Gly 


Gin 


Ser 
335 


Cys 


Cys 


Leu 


Ser 


Asn 


Asn 


Gly 


Cys 
350 


Met 


Arg 


Asp 


lie 


Gin 


Pro 


His 


Pro 
365 


Arg 


Thr 


Gly 


Thr 


Tyr 


Asp 


Phe 


Ala 
380 


Cys 


Pro 


He 


Thr 


His 


Ala 


Leu 


Arg 
395 


Thr 


Thr 


Glu 


Phe 


Tyr 


Trp 


He 


He 
410 


Glu 


Ala 


Leu 


Trp 


Glu 


Tyr 


Ser 


Arg 
425 


Leu 


Asn 


Leu 


Arg 


Lys 


Leu 


Thr 


Trp 
440 


Phe 


Val 


Asn 


Asp 


Asp 


Pro 


Arg 


Phe 
455 


Pro 


Thr 


Val 


Met 


Thr 


Val 


Glu 


Gly 
470 


Leu 


Lys 


Gin 


Ser 


Arg 


Ser 


Val 


Val 
485 


Asn 


Met 


Glu 


Asn 


Lys 


Lys 


Val 


He 
500 


Asp 


Pro 


Val 


Leu 


Lys 


Lys 


Glu 


Val 
515 


He 


Pro 


Val 


Glu 


Met 


Lys 


Glu 


Val 
530 


Ala 


Lys 


His 


Leu 


Lys 


Pro 


Val 


Trp 
545 


Tyr 


Ser 


Pro 


Asp 


Ala 


Glu 


Thr 


Phe 
560 


Ser 


Glu 


Gly 


Trp 


Gly 


Asn 


Leu 


Asn 
575 


He 


Thr 


Lys 


Lys 


He 


He 


Ser 


Leu 
590 


Asp 


Ala 


Lys 


Tyr 


Lys 


Lys 


Thr 


Thr 
605 


Lys 


Val 


Thr 


Ala 


Leu 


Pro 


He 


Pro 
620 


Val 


He 


Cys 


Thr 


Lys 


Pro 


Val 


Leu 
635 


Gly 


Lys 


Asp 


Asn 


Lys 


Asn 


Ser 


Lys 
650 


His 


Glu 


Glu 


Leu 


Lys 


Asp 


Leu 


Lys 
665 


Lys 


Gly 


Asp 


Gly 


Phe 


Phe 


He 


Cys 
680 


Asp 


Gin 


Pro 


Ser 


Cys 


Lys 


Glu 


Ala 
695 


Pro 


Cys 


Val 


His 


Thr 


Lys 


Glu 


Met 
710 


Pro 


Thr 


Ser 


Val 


Glu 


Ala 


Thr 


Lys 
725 


Asn 


Glu 


Thr 


Pro 


Thr 


Pro 


Ser 


Leu 
740 


Asn 


Asn 


Asn 


Leu 


Val 


Leu 


Tyr 


Asn 
755 


Arg 


Val 


Ala 


Glu 


Leu 


Lys 


Ala 


Lys 
770 


Lys 


Ala 


Pro 


Val 


Lys 


Gin 


Leu 


Leu 
785 


Ser 


Leu 


Lys 


Gly 


Gin 


Glu 


Tyr 


Lys 
800 


Pro 


Gly 


Asn 


Asn 


He 


Ser 


Ser 


Asn 


Ser 


Ser 


Ala 



310 315 



Glu 


Glu 


Met 


Lys 


Lys 


Gly 


Ser 




325 










330 


Arg 


Ala 


Lys 


He 


Asp 


Met 


Ser 




340 










345 


Pro 


Thr 


Leu 


Tyr 


Arg 


Cys 


Lys 




355 










360 


Asn 


Lys 


Tyr 


Asn 


Val 


Tyr 


Pro 




370 










375 


Val 


Asp 


Ser 


He 


Glu 


Gly 


Val 




385 










390 


Tyr 


His 


Asp 


Arg 


Asp 


Glu 


Gin 




400 










405 


Gly 


He 


Arg 


Lys 


Pro 


Tyr 


He 




415 










420 


Asn 


Asn 


Thr 


Val 


Leu 


Ser 


Lys 




430 










435 


GlU 


Gly 


Leu 


Val 


Asp 


Gly 


Trp 




445 










450 


Arg 


Gly 


Val 


Leu 


Arg 


Arg 


Gly 




460 










465 


Phe 


He 


Ala 


Ala 


Gin 


Gly 


Ser 




475 










480 


Trp 


Asp 


Lys 


He 


Trp 


Ala 


Phe 




490 










495 


Ala 


Pro 


Arg 


Tyr 


Val 


Ala 


Leu 




505 










510 


Asn 


Val 


Pro 


Glu 


Ala 


Gin 


Glu 




520 










525 


Pro 


Lys 


Asn 


Pro 


Glu 


Val 


Gly 




535 










540 


Lys 


Val 


Phe 


He 


Glu 


Gly 


Ala 




550 










555 


Glu 


Met 


Val 


Thr 


Phe 


He 


Asn 




565 










570 


He 


His 


Lys 


Asn 


Ala 


Asp 


Gly 




580 










585 


Leu 


Asn 


Leu 


Glu 


Asn 


Lys 


Asp 




595 










600 


Trp 


Leu 


Ala 


Glu 


Thr 


Thr 


His 


610 










615 


Val 


Thr 


Tyr 


GlU 


His 


Leu 


He 




625 










630 


Glu 


Asp 


Phe 


Lys 


Gin 


Tyr 


Val 




640 










645 


Leu 


Met 


Leu 


Gly 


Asp 


Pro 


Cys 




655 










660 


He 


He 


Gin 


Leu 


Gin 


Arg 


Arg 




670 










675 


Tyr 


Glu 


Pro 


Val 


Ser 


Pro 


Tyr 




685 










690 


Leu 


He 


Tyr 


He 


Pro 


Asp 


Gly 




700 










705 


Gly 


Ser 


Lys 


Glu 


Lys 


Thr 


Lys 




715 










720 


Ser 


Ala 


Pro 


Phe 


Lys 


Glu 


Arg 




730 










735 


Cys 


Thr 


Thr 


Ser 


Glu 


Asp 


Ser 




745 










750 


Val 


Gin 


Gly 


Asp 


Val 


Val 


Arg 




760 










765 


Lys 


Glu 


Asp 


Val 


Asp 


Ala 


Ala 




775 










780 


Ala 


Glu 


Tyr 


Lys 


Glu 


Lys 


Thr 




790 










795 


Pro 


Pro 


Ala 


Glu 


He 


Gly 


Gin 




805 










810 


Ser 


He 


Leu 


Glu 


Ser 


Lys 


Ser 
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815 



Leu 


Tyr 


Asp 


Glu Val 
830 


Lys 


Ala 


Glu 


Lys Ser 
845 


Cys 


Leu 


Leu 


Ser Leu 
860 


Glu 


Tyr 


He 


Pro Gly 
875 


Ser 


Pro 


Thr 


Arg Asn 
890 


Ala 


Lys 


Val 


Leu Phe 
905 


Arg 


Lys 


Leu 


Lys Thr 
920 


Ala 


Val 


Gin 


Glu Leu 
935 


lie 


Gly 


Val 


Glu Tyr 
950 


Lys 


Asp 


Lys 


Lys Lys 
965 


Asn 


Lys 


Pro 


Gin Lys 
980 


Lys 


Asn 


Gin 


Gly Gly 
995 


Gin 


Gly 


Pro 


Lys Lys 
1010 


Glu 


Glu 


Asn 


Leu Ala 
1025 


Glu 


Met 


He 


Glu Tyr 
1040 


Pro 


Trp 


Ala 


Tyr Ala 
1055 


Ala 


Glu 


He 


Lys Lys 
1070 


Phe 


Val 


Ser 


Gin Ser 
1085 


Asp 


Phe 


Ala 


Pro Glu 
1100 


Glu 


Leu 


Ala 


Glu Pro 
1115 


Met 


Tyr 


Pro 


Ala Tyr 
1130 


Pro 


He 


Lys 


Leu Asn 
1145 


Lys 


His 


Pro 


Gin Pro 
1160 


Glu 


Gly 


His 


Ser Ala 
1175 


Val 


Leu 


Gin 


He Leu 
1190 


Leu 


Ala 


He 


Pro Val 
1205 


Phe 


Ala 


Gly 


Gly Asp 
1220 


Ala 


t Ser 


Gly 


Arg Ala 
1235 


Gin 


Asn 


Phe 


Ser Lys 
1250 


lie 


Pro 


Gly 


Glu Lys 
1265 


Thr 


Thr 


Arg 


Thr He 
1280 


Met 


Gly 


Leu 


Val Leu 






1295 


He 


He 


Pro 


Cys Gly 
1310 


Glu 


Ala 


Leu 


He Ala 







820 


Ala 


Ala 


Gin Gly Glu 






835 


Pro 


Lys 


Ala Lys He 




850 


Lys 


Ala 


Gin Tyr Lys 




865 


Gin 


Pro 


Pro Leu Ser 






880 


Ser 


Glu 


Pro Ala Gly 






895 


Asp 


Lys 


Val Ala Ser 






910 


Glu 


Lys 


Ala Pro Lys 






925 


Leu 


Gin 


Leu Lys Ala 






940 


Lys 


Pro 


Val Ser Ala 






955 


Lys 


Glu 


Lys Glu Asn 






970 


Gin 


Asn 


Asp Gly Gin 






985 


Gly 


Leu 


Ser Ser Ser 






1000 


Gin 


Thr 


Arg Leu Gly 






1015 


Asp 


Trp 


Tyr Ser Gin 






1030 


His 


Asp 


He Ser Gly 






1045 


He 


Trp 


Glu Ala He 




1060 


Leu 


Gly 


Val Glu Asn 




1075 


Ala 


Leu 


Glu Lys Glu 






1090 


Val 


Ala 


Trp Val Thr 






1105 


He 


Ala 


He Arg Pro 






1120 


Ala 


Lys 


Trp Val Gin 






1135 


Gin 


Trp 


Cys Asn Val 






1150 


Phe 


Leu 


Arg Thr Arg 






1165 


Phe 


Ala 


Thr Met Glu 






1180 


Asp 


Leu 


Tyr Ala Gin 






1195 


Val 


Lys 


Gly Arg Lys 






1210 


Tyr 


Thr 


Thr Thr He 




1225 


He 


Gin 


Gly Gly Thr 






1240 


Met 


Phe 


Glu He Val 






1255 


Gin 


Phe 


Ala Tyr Gin 






1270 


Gly 


Val 


Met Thr Met 




1285 


Pro 


Pro 


Arg Val Ala 






1300 


He 


Thr 


Asn Ala Leu 






1315 


Lys 


Cys 


Asn Asp Tyr 



825 



Val 


Val 


Arg 


Lys Leu 
840 


Asn 


Glu 


Ala 


Val Glu 
855 


Glu 


Lys 


Thr 


Gly Lys 
870 


Gin 


Ser 


Ser 


Asp Ser 
885 


Leu 


Glu 


Thr 


Pro Glu 
900 


Gin 


Gly 


Glu 


Val Val 
915 


Asp 


Gin 


Val 


Asp He 
930 


Gin 


Tyr 


Lys 


Ser Leu 
945 


Thr 


Gly 


Ala 


Glu Asp 
960 


Lys 


Ser 


Glu 


Lys Gin 
975 


Arg 


Lys 


Asp 


Pro Ser 
990 


Gly 


Ala 


Gly 


Glu Gly 
1005 


Leu 


Glu 


Ala 


Lys Lys 
1020 


Val 


He 


Thr 


Lys Ser 
1035 


Cys 


Tyr 


He 


Leu Arg 
1050 


Lys 


Asp 


Phe 


Phe Asp 
1065 


Cys 


Tyr 


Phe 


Pro Met 
1080 


Lys 


Thr 


His 


Val Ala 






1095 


Arg 


Ser 


Gly 


Lys Thr 
1110 


Thr 


Ser 


Glu 


Thr Val 
1125 


Ser 


His 


Arg 


Asp Leu 
1140 


Val 


Arg 


Trp 


Glu Phe 
1155 


Glu 


Phe 


Leu 


Trp Gin 
1170 


Glu 


Ala 


Ala 


Glu Glu 
1185 


Val 


Tyr 


Glu 


Glu Leu 
1200 


Thr 


Glu 


Lys 


Glu Lys 
1215 


Glu 


Ala 


Phe 


He Ser 
1230 


Ser 


His 


His 


Leu Gly 
1245 


Phe 


Glu 


Asp 


Pro Lys 
1260 


Asn 


Ser 


Trp 


Gly Leu 
1275 


Val 


His 


Gly 


Asp Asn 
1290 


Cys 


Val 


Gin 


Val Val 






1305 


Ser 


Glu 


Glu 


Asp Lys 
1320 


Arg 


Arg 


Arg 


Leu Leu 
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1325 


Ser 


Val 


Asn 


He Arg 
1340 


Pro 


Gly 


Trp 


Lys Phe 
1355 


Arg 


Leu 


Glu 


Val Gly 
1370 


Ala 


Val 


Arg 


Arg Asp 
1385 


Glu 


Ala 


Glu 


Thr Lys 
1400 


Thr 


Leu 


Phe 


Thr Arg 
1415 


Val 


Ala 


Asn 


Thr Met 
1430 


Lys 


He 


Val 


Gin lie 
1445 


Trp 


He 


Lys 


Lys Thr 
1460 


Ala 


Pro 


Ser 


Met Gly 
1475 


Leu 


Cys 


Glu 


Leu Gin 
1490 


Pro 


Ala 


Lys 


Tyr Tyr 
1505 



<210> 49 
<211> 238 
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<220> 
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<223> Incyte ID No. 

<400> 49 



Met 


Ala 


Ala 


Arg 


Thr 


1 








5 


Gly 


Cys 


Arg 


Pro 


Lys 
20 


Val 


Arg 


Asn 


Gly 


Arg 
35 


Arg 


Thr 


Leu 


Pro 


Leu 
50 


Ala 


Arg 


Thr 


Phe 


Phe 
65 


Lys 


Glu 


Tyr 


Ser 


Glu 
80 


Met 


Tyr 


Asp 


Val 


Val 
95 


Pro 


Trp 


Cys 


Lys 


Lys 
110 


Cys 


Lys 


Thr 


Arg 


Leu 
125 


Tyr 


Thr 


Ser 


Val 


Val 
140 


Ser 


Cys 


Thr 


Asp 


Gly 
155 


Cys 


Phe 


Ser 


Pro 


Gly 
170 


Asp 


Phe 


Ser 


lie 


Ser 








185 


Leu 


Ala 


Thr 


Leu 


Phe 
200 









1330 


Val 


Arg 


Ala 


Asp Leu 
1345 


Asn 


His 


Trp 


Glu Leu 






1360 


Pro 


Arg 


Asp 


Met Lys 
1375 


Thr 


Gly 


Glu 


Lys Leu 
1390 


Leu 


Gin 


Ala 


lie Leu 
1405 


Ala 


Ser 


Glu 


Asp Leu 
1420 


Glu 


Asp 


Phe 


Gin Lys 
1435 


Pro 


Phe 


Cys 


Gly Glu 
1450 


Thr 


Ala 


Arg 


Asp Gin 
1465 


Ala 


Lys 


Ser 


Leu Cys 
1480 


Pro 


Gly 


Ala 


Lys Cys 
1495 


Thr 


Leu 


Phe 


Gly Arg 
1510 



2742442 



Gly 


His 


Thr 


Ala 


Leu 








10 


Ser 


Ala 


Thr 


Ala 


Ala 










25 


Tyr 


Leu 


Ala 


Ser 


Cys 










40 


His 


Thr 


Ser 


He 


Leu 










55 


Lys 


lie 


Thr 


Ala 


Pro 










70 


Arg 


Arg 


He 


Leu 


Gly 










85 


Ser 


Gly 


Val 


Glu 


Asp 










100 


Ser 


Asp 


Val 


He 


Ser 








115 


Glu 


lie 


Gly 


Phe 


Pro 










130 


Thr 


Leu 


Val 


Lys 


Pro 








145 


Arg 


Leu 


Phe 


Asn 


His 










160 


Leu 


Pro 


Gly 


Tyr 


Pro 










175 


Phe 


Glu 


Phe 


Arg 


ser 










190 


Phe 


Asp 


Glu 


Val 


Val 










205 



1335 



Arg 


Asp 


Asn 


Tyr Ser 








1350 


Lys 


Gly 


Val 


Pro lie 








1365 


Ser 


Cys 


Gin 


Phe Val 






1380 


Thr 


Val 


Ala 


Glu Asn 








1395 


Glu 


Asp 


He 


Gin Val 








1410 


Lys 


Thr 


His 


Met Val 






1425 


lie 


Leu 


Asp 


Ser Gly 








1440 


He 


Asp 


Cys 


Glu Asp 








1455 


Asp 


Leu 


Glu 


Pro Gly 








1470 


lie 


Pro 


Phe 


Lys Pro 








1485 


Val 


Cys 


Gly 


Lys Asn 








1500 


Ser 


Tyr 







Arg 


Arg 


Val 


Val 


Ser 
15 


Gly 


Ala 


Gin 


Ala 


Pro 
30 


Gly 


He 


Leu 


Met 


Ser 
45 


Pro 


Lys 


Glu 


lie 


Cys 
60 


Leu 


lie 


Asn 


Lys 


Arg 
75 


Tyr 


Ser 


Met 


Gin 


Glu 
90 


Tyr 


Lys 


His 


Phe 


Val 
105 


Lys 


Arg 


Ser 


Gly 


Tyr 
120 


Pro 


Val 


Leu 


Glu 


Arg 
135 


His 


Leu 


Val 


Lys 


Ala 
150 


Leu 


Glu 


Thr 


lie 


Trp 
165 


Arg 


Thr 


Cys 


Thr 


Leu 
180 


Leu 


Leu 


His 


Ser 


Gin 
195 


Lys 


Gin 


Met 


Val 


Ala 



210 
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Ala Phe Glu Arg Arg 
215 

He Pro Arg Glu Leu 
230 



Ala Cys Lys Leu Tyr 
220 

Met Leu His Glu Val 
235 



Gly Pro Glu Thr Asn 
225 

His His Thr 



<210> 50 
<211> 653 
<212> PRT 
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<220> 

<221> misc_feature 
<223> Incyte ID No.: 

<400> 50 



Met 


Pro 


Phe 


Ser 


Ala 


1 








5 


Arg 


Tyr 


Ala 


Pro 


Pro 
20 


Thr 


Ser 


Ser 


Leu 


Glu 
35 


Met 


Ser 


Glu 


Ser 


Lys 
50 


Gly 


Thr 


Pro 


Leu 


Asn 
65 


Gly 


Pro 


Gly 


Ser 


Leu 
80 


Ser 


Gin 


Pro 


Arg 


Asn 
95 


Thr 


Lys 


Pro 


Ser 


Ser 








110 


Thr 


His 


Glu 


Leu 


Arg 
125 


Gly 


Pro 


Lys 


Leu 


Met 
140 


Lys 


Cys 


Lys 


Lys 


Asp 
155 


Asp 


Lys 


Ser 


Trp 


Lys 
170 


Tyr 


Glu 


Gly 


Pro 


Tyr 
185 


Glu 


Cys 


Arg 


Tyr 


Ser 
200 


Glu 


He 


Asp 


Val 


Trp 
215 


Glu 


Ala 


Phe 


Phe 


Gly 
230 


Lys 


Leu 


Leu 


Gin 


Glu 
245 


Lys 


Cys 


Phe 


Asp 


His 
260 


Asp 


Asn 


Ser 


Thr 


Ala 
275 


Glu 


Asp 


Asn 


Lys 


Cys 
290 


Lys 


Tyr 


Ser 


Lys 


He 
305 


Leu 


Cys 


Arg 


His 


Glu 
320 


Cys 


Phe 


Tyr 


Ala 


His 
335 


Gin 


Asn 


Glu 


Thr 


Gly 
350 


Lys 


Arg 


Tyr 


Trp 


Gin 
365 


Val 


Leu 


Gly 


Asn 


Gin 



3511087 



Ser 


Leu 


Leu 


Gly Thr 










10 


Pro 


Ser 


Phe 


Ser 


Glu 










25 


Asp 


Phe 


Cys 


Ser 


Ser 










40 


Arg 


Asp 


Leu 


Ser 


Thr 










55 


Asn 


Ser 


Asn 


Ser 


Ser 










70 


Phe 


Ala 


Ser 


Glu 


Asn 










85 


Asp 


Phe 


Gly 


Asn 


Phe 










100 


Ser 


Val 


Thr 


Pro 


Arg 










115 


Gin 


Ala 


Cys 


Gin 


He 








13 0 


Asp 


Phe 


Thr 


Tyr 


His 










145 


He 


Leu 


He 


Gly Arg 










160 


Lys 


He 


Arg 


Pro 


Arg 










175 


Tyr 


He 


Cys 


Lys 


Asp 










190 


Gly 


His 


Cys 


Thr 


Phe 










205 


Thr 


Leu 


Glu 


Arg 


Lys 










220 


Gly 


Asn 


Gly 


Lys 


He 










235 


His 


Leu 


Gly 


Glu 


Phe 








250 


Lys 


Pro 


Arg 


Met 


He 










265 


Cys 


Ser 


His 


Pro 


Val 










280 


Leu 


Val 


His 


He 


Leu 










295 


Arg 


Ser 


Phe 


His 


Gly 










310 


Val 


Arg 


Tyr 


Gly Cys 










325 


Ser 


Leu 


Val 


Glu 


Leu 










340 


He 


Ser 


His 


Asp 


Ala 








355 


Asn 


Leu 


Glu 


Ala 


Asn 










370 


He 


Met 


Pro 


Gly 


Phe 



Leu 


Pro 


He Gly 


Ala 










15 


Phe 


Tyr 


Pro 


Pro 


Leu 










30 


Leu 


Asn 


Ser 


Phe 


Ser 










45 


Ser 


Thr 


Ser 


Arg 


Glu 










60 


Leu 


Leu 


Leu 


Met 


Asn 










75 


Phe 


Leu Gly 


He 


Ser 










90 


Phe 


Gly 


Ser 


Ala 


Val 








105 


His 


Pro 


Leu 


Glu 


Gly 










120 


Cys 


Phe 


Val 


Lys 


Ser 










135 


Ala 


Asn 


He 


Asp 


His 










150 


He 


Lys 


Asn 


Val 


Glu 








165 


Pro 


Thr 


Lys 


Thr 


Asn 








180 


Val 


Ala 


Ala 


Glu 


Glu 










195 


Aia 


Tyr 


Cys 


Gin 












210 


Gly 


Ala 


Phe 


Ser 


Arg 










225 


Asn 


Leu 


Thr 


Val 


Phe 










240 


He 


Phe 


Leu 


Cys 


Glu 










255 


Ser 


Lys 


Arg Asn 


Lys 










270 


Thr 


Lys 


His 


Glu 


Phe 










285 


Arg 


Glu 


Thr 


Thr 


Val 










300 


Gin 


Cys 


Gin 


Leu 


Asp 










315 


Leu 


Arg Glu Asp 


Glu 










330 


Lys 


Val 


Trp 


He 


Met 










345 


He 


Ala 


Gin 


Glu 


Ser 










360 


Val 


Pro Gly Ala 


Gin 










375 


Leu 


Asn 


Met 


Lys 


He 
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380 



Lys 


Phe 


Val 


Cys 


Ala 
395 


Gin 


Cys 


Leu 


Arg 


Pro 


Asp 


Lys 


Asn 


Arg 
410 


Lys 


Tyr 


Cys 


Ser 


Trp 


Thr 


Lys 


Asp 


Arg 
425 


Arg 


Ala 


Met 


Arg 


Lys 


Lys 


Trp 


Met 


Asn 
440 


He. 


Arg 


Pro 


Leu 


Pro 


Leu 


Gin 


Phe 


Asp 
455 


Leu 


Cys 


Asn 


His 


Cys 


Gin 


Tyr 


Val 


Gly 
470 


Asn 


Cys 


Ser 


Phe 


Arg 


Glu 


Val 


Trp 


Thr 
485 


Tyr 


Met 


Lys 


Glu 


Glu 


Gin 


Phe 


Tyr 


Glu 
500 


Leu 


Trp 


Leu 


Lys 


Ser 


Glu 


Asp 


He 


Ala 


Ser 


Gin 


Ser 


Asn 








515 










He 


His 


Met 


Pro 


Thr 

530 


Asp 


Tyr 


Ala 


Glu 


Cys 


Trp 


Met 


Cys 


Gly 
545 


Lys 


Asn 


Cys 


Asn 


Gly 


His 


He 


Ser 


Ser 
560 


Glu 


Lys 


His 


Lys 


Glu 


Asp 


Asp 


Gin 


Tyr 
575 


Cys 


Trp 


Gin 


His 


Phe 


Ser 


He 


Cys 


Asp 
590 


Arg 


Tyr 


Met 


Asn 


Asn 


Ser 


Cys 


Lys 


Phe 


Ala 


His 


Gly Asn 










605 










Glu 


Glu 


Arg Arg 


Asp 


Ala 


Leu 


Lys 


Met 










620 










Lys 


Asp 


His 


Leu 


He 
635 


Gly 


Pro 


Asn 


Asp 


Ser 


Phe 


Leu 


Phe 


Lys 


Asp 


Leu 


Asn 





650 
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Met 


Gin 


Asp 


Thr 


Gly 


Ser 


Val 


Val 


Pro 


1 








5 










Gly 


Tyr 


Ala 


Ala 


Leu 


Val 


Ala 


Ser Gly 










20 










Lys 


Ala 


Gly 


Ser 


Val 
35 


Pro 


Ser 


Leu 


Ala 


Ser 


Leu 


Ala 


Gly 


Leu 


Gly Ala 


Tyr 


Gin 










50 
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Ala 


Tyr 


Arg 


His 


Leu 
20 


Phe 


Cys 


Pro 


Asp 


Ala 


Phe 


He 


Thr 


Gly 


Gly 


Gly 


Ser Gly 










35 










Glu 


He 


Phe 


Met 


Arg 
50 


His 


Gly 


Cys 


His 


Ser 


Leu 


Pro 


Arg 


Val 
65 


Leu 


Thr 


Ala 


Ala 


Thr 


Gly 


Arg 


Arg 


Cys 
80 


Leu 


Pro 


Leu 


Ser 


Pro 


Ala 


Val 


Met 


Ala 
95 


Ala 


Val 


Asp 


Gin 


Arg 


He 


Asp 


He 


Leu 
110 


He 


Asn 


Cys 


Ala 


Pro 


Ala 


Gly 


Ala 


Leu 
125 


Ser 


Phe 


Asn 


Ala 


He 


Asp 


Thr 


Ser 


Gly 


Thr 


Phe 


Asn 


Val 



2 
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Trp 


Glu Ala Gly Arg 


He 


10 




15 


Arg 


Gly Arg Gly Gly 


Ser 


25 




30 


Val 


Pro Asp Ser Ala 


His 


40 




45 


Ser 


Gly Cys Cys Gly 


Leu 


55 




60 


Met 


Glu Glu Gly Gly 


Asn 


70 




75 


Leu 


Leu Val Leu Ser 


Gly 


85 




90 


Phe 


Val Ser Gly Phe 


Leu 


100 




105 


Phe 


Gly Leu Val Gin 


Ser 


115 


120 


Ser 


Met Gly Cys Ala 


Phe 


130 




13 5 


His 


Ala Trp Ala Gin 


Leu 


145 




150 


Leu 


Leu Phe Leu Ser 


Leu 


160 




165 


Leu 


Glu Pro Arg Thr 


Thr 


175 




180 


Glu 


Lys Glu Arg Gly 


Leu 


190 




195 


Gly 


Ser Asp Pro Tyr 


Arg 


205 




210 


Gly 


Asp Asp Cys Leu 


Pro 


10 




15 


Leu 


Leu Arg Asp Lys 


Val 


25 




30 


He 


Gly Phe Arg He 


Ala 


40 




45 


Thr 


Val He Ala Ser 


Arg 


55 




60 


Arg 


Lys Leu Ala Gly 


Ala 


70 




75 


Met 


Asp Val Arg Ala 


Pro 


85 




90 


Ala 


Leu Lys Glu Phe 


Gly 


100 




105 


Ala 


Gly Asn Phe Leu 


Cys 


115 




120 


Phe 


Lys Thr Val Met 


Asp 


130 




135 


Ser 


Arg Val Leu Tyr 


Glu 
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140 145 150 

Lys Phe Phe Arg Asp His Gly Gly Val He Val Asn He Thr Ala 

155 160 165 

Thr Leu Gly Asn Arg Gly Gin Ala Leu Gin Val His Ala Gly Ser 

170 175 180 

Ala Lys Ala Ala Val Asp Ala Met Thr Arg His Leu Ala Val Glu 

185 190 195 

Trp Gly Pro Gin Asn He Arg Val. Asn Ser Leu Ala Pro Gly Pro 

200 205 210 

He Ser Gly Thr Glu Gly Leu Arg Arg Leu Gly Gly Pro Gin Ala 

215 220 225 

ser Leu Ser Thr Lys Val Thr Ala Ser Pro Leu Gin Arg Leu Gly 

230 235 240 

Asn Lys Thr Glu He Ala His Ser Val Leu Tyr Leu Ala Ser Pro 

245 250 255 

Leu Ala Ser Tyr Val Thr Gly Ala Val Leu Val Ala Asp Gly Gly 

260 265 270 

Ala Trp Leu Thr Phe Pro Asn Gly Val Lys Gly Leu Pro Asp Phe 

275 280 285 

Ala Ser Phe Ser Ala Lys Leu 

290 



<210> 59 
<211> 158 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No.: 

<400> 59 



Glu 


Gly 


Gly Pro 


Ser 


1 






5 


Lys 


Pro 


Asp Gly 


Val 








20 


Arg 


Phe 


Glu Arg 


Arg 








35 


Gin 


Ala 


Pro Glu 


Ser 








50 


Arg 


Lys 


Pro Phe 


Tyr 








65 


Pro 


Val 


Val Ala 


Met 








80 


Ser 


Arg 


Ala Met 


He 








95 


Gly 


Thr 


He Arg 


Gly 








110 


He 


His 


Ala Ser 


Asp 








125 


Leu 


Trp 


Phe Gin 


Ser 








140 


Gin 


His 


Ser Ser 


He 








155 



964996 



Trp 


Thr Arg Glu 


Arg 






10 


Gin 


Arg Arg Leu 


Val 






25 


Gly 


Phe Thr Leu 


Val 




40 


Val 


Leu Ala Glu 


His 






55 


Pro 


Ala Leu He 


Arg 






70 


Val 


Trp Glu Gly 


Tyr 






85 


Gly 


His Thr Asp 


Ser 






100 


Tyr 


Phe Ser Val 


His 






115 


Ser 


Val Glu Gly 


Ala 






130 


Ser 


Glu Leu Val 


Ser 






145 


His 


Pro Ala 





Thr Leu Val Ala Val 
15 

Gly Asp Val He Gin 
30 

Gly Met Lys Met Leu 
45 

Tyr Gin Asp Leu Arg 
60 

Tyr Met Ser Ser Gly 
75 

Asn Val Val Arg Ala 
90 

Ala Glu Ala Ala Pro 
105 

He Ser Arg Asn Val 
120 

Gin Arg Glu He Gin 
135 

Trp Ala Asp Gly Gly 
150 



<210> 60 
<211> 559 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701884305H1 



30/52 



WO 01/36684 



PCT/US00/31743 



<400> 60 

ggaaacctaa acgcgcgtgc gcttcttcca cgccacggaa accgtgcagg cctggtgtgg 60 

tctccaaagt gactgaacaa tgcagaagga cagtggccca ctggttcctt tacattatta 120 

tggtttcggc tatgcggccc tggtggctac tggtgggatt attggctatg caaaagcagg 180 

tagtgtgccg tccctggctg ctggactctt ctttgggggc ctggcaggcc tgggtgccta 240 

ccagctgtct caggacccca ggaacgtgtg ggttttccta gctacgtctg ggactttggc 3 00 

tggcattatg gggatgagat tctacaactc tgggaaattt atgcctgcag gtttgatcgc 360 

gggagccagt ttgctgatgg ttgccaaact tggacttagt atgttgagtt caccccatcc 420 

gtagtagcca tagtcctgcg tgggctcatg atgagttgac actctccagt cctccacatt 480 

accacgctga agagataaga acagcaaaga cctacactga gcacatggag gcgaagacgt 540 

ggttactata gtgaccgtc 559 



<210> 61 
<211> 326 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701607951H1 



<400> 61 



gtgttgggtg tgttcttact ttgcggattt taccaccctg gaattgttcc gtacgcgcag 60 

gcgcgcgggc gctctcccgt gcactctctg ctgagctagc ggactgcccg cctctctaaa 120 

acgtcctgta actgcggttc cgggagtgga aacctaaacg cgcgtgcgct tcttccacgc 180 

cacggaaacc gtgcaggcct ggtgtggtct ccaaagtgac tgaacaatgc agaaggacag 240 

tggcccactg gttcctttac attattatgg tttcggctat gcggccctgg tggctactgg 3 00 

tgggattatt ggctatgcaa aagcag 32 6 



<210> 62 
<211> 333 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701644253H1 

<400> 62 

aacgtcctgt aactgcggtt ccgggagtgg aaacctaaac gcgcgtgcgc tttcttccac 60 

gccacggaaa accgtgcagg cctngtgtgg tctccanagt gactgaacaa tgcagaagga 120 

cagtggccca ctggntcctt tacattatta tggtttcggc tatgcggccc tggtggctac 180 

tggtgggatt attggctatg caaaagcagg tagtgtgccg tccctggctg ctggactctt 240 

ctttgggggc ctggcaggcc tgggtgccta ccagctgtct caggacccca ggaacgtgtg 3 00 

ggttttccta gctacgnctg ggactttggc tgg 333 



<210> 63 
<211> 318 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701513151H1 

<400> 63 

cttactttgc ggattttacc accctggaat 
cccgtgcact ctctgctgag ctagcggact 
ggttccggga gtggaaacct aaacgcgcgt 
ggcctggtgt ggtctccaaa gtgactgaac 
tttacattat tatggtttcg gctatgcggc 



tgttccgtac gcgcangngc gcggggctct 60 
gcccgcctct ctaaaacgtc ctgtaactgc 120 
gcgcttcttc cacgccacgg aaaccgtgca 180 
aatgcagaag gacagtggcc cactggttcc 240 
cctggtggct actggtggga ttattggcta 300 
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<210> 64 
<211> 315 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701652337H1 

<400> 64 

cagcncaggc ctccgggctc cagctccggt 

gtgactgaac aatgcagaag gacagtggcc 

gctatgcggc cctggtggct actggtggga 

cgtccctggc tgctggactc ttctttgggg 

ctcaggaccc caggaacgtg tgggttttcc 

ggggatgaga ttcta 



gttgggtnca ggcctggtgt ggtctccaaa 60 
cactggttcc tttacattat tatggtttcg 120 
ttattggcta tgcaaaagca ggtagtgtgc 180 
gcctggcagg cctgggtgcc taccagctgt 240 
tagctacgtc tgggactttg gctggcatat 300 

315 



<210> 65 
<211> 313 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701562183H1 

<400> 65 

ggtctccaaa gtgactgaac aatgcagaag 
tatggtttcg gctatgcggc cctggtggct 
ggtagtgtgc cgtccctggc tgctggactc 
taccagctgt ctcaggaccc caggaacgtg 
gctggcatta tggggatgag attctacaac 
gcgggancat ttt 



gacagtggcc cactggttcc tttacattat 60 
actggtggga ttattggcta tgcaaaagca 120 
ttctttgggg gcctggcagg cctgggtgcc 180 
tgggttttcc tagctacgtc tgggactttg 240 
tctgggaaat ttatgcctgc aggtttgatc 3 00 

313 



<210> 66 
<211> 304 
<212> DNA 

<213> Rattus norvegicus 

<220> misc_feature 

<223> Incyte ID No.: 700227356H1 

<400> 66 

cgccgtcgtc ctccagcgca ggcctccggg 
tgtggtctcc aaagtgactg aacaatgcag 
tattatggtt tcggctatgc ggccctggtg 
gcaggtagtg tgccgtccct ggctgctgga 
gcctaccagc tgtctcagga ccccaggaac 
ttgg 



ctccagctcc ggtgttgggt gcaggcctgg 60 
aaggacagtg gcccactggt tcctttacat 120 
gctactggtg ggattattgg ctatgcaaaa 180 
ctcttctttg ggggcctggc aggcctgggt 240 
gtgtgggttt tcctagctac gtctgggact 3 00 

304 



<210> 67 
<211> 327 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701649802H1 
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<400> 67 

ctccggtgtt gggtgcaggc 
agtggaccac tggttcctta 
gtgggattat tgnctttgca 
ttgggggcct ggcaggcctg 
ttttcctagc tacgtctggg 
ggaaatttat gcctgcagtt 



ctggtgtggt ctccaaagtg 
cattattatg gtttcggcta 
aaagcaggta gtgtgccgtc 
ggtgcctacc agctgtctca 
actttggctg gcattatggg 
tgatcgc 



actgaacaat gcagaaggac 60 
tgcggccctg gtggctactg 120 
cctggctgtt ggactcttct 180 
ggaccccagg aacgtgtggg 240 
gatgagattc tacaactctg 300 

. 327 



<210> 68 
<211> 305 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700226414H1 

<400> 68 

tccagctccg gtgttgggtg caggcctggt 60 

aggacagtgg cccactggtt cctttacatt 120 

ctactggtgg gattattggc tatgcaaaag 180 

tcttctttgg gggcctggca ggcctgggtg 240 

gtgggttttc ctagctacgt ctgggacttg 3 00 

305 



gccgtcgtcc 
gtggtctcca 
attatggttt 
caggtagtgt 
cctaccagct 
gctgg 



tccagcncag 
aagtgactga 
cggctatgcg 
gccgtccctg 
gtctcaggac 



gcctccgggc 
acaatgcaga 
gccctggtgg 
gctgctggac 
cccaggaagt 



<210> 69 
<211> 295 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700275094H1 

<400> 69 

tcctccagcn caggcntccg ggctccagct 
ccaaagtgac tgaacaatgc agaaggacag 
tttcggctat gcggccctgg tggctactgg 
tgtgccgtcc ctggctgctg gactcttctt 
agctgtctca ggaccccagg aacgtgtggg 



ccggtgttgg gtgcaggcct ggtgtggtct 60 
tggcccactg gttcctttac attattatgg 120 
tgggattatt ggctatgcaa aagcaggtag 180 
tggggggcct ggcaggcctg ggtgcctacc 240 
ttttcctagc tacgtctggg atttg 295 



<210> 70 
<211> 301 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700226425H1 



<400> 70 



cctgacctct gttcctgtgc tcccgccgtc gtcctccagc gcaggcctcc gggctccagc 60 
tccggtgttg ggtgcaggcc tggtgtggtc tccaaagtga ctgaacaatg cagaaggaca 120 
gtggcccact ggttccttta cattattatg gtttcggcta tgcggccctg gtggctactg 180 
gtgggattat tggctatgca aaagcaggta gtgtgccgtc cctggctgct ggactcttct 24 0 
ttgggggcct ggcaggcctg ggtgcctacc agctgtctca ggaccccagg aacgtgtggg 300 
t 301 



<210> 71 
<211> 282 



33/52 



WO 01/36684 



PCTYUS00/31743 



<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700275207H1 



<400> 71 



tcctccagcg caggcctccg ggctccagct 

ccaaagtgac tgaacaatgc agaaggacag 

tttcggctat gcggccctgg tggctactgg 

tgtgccgtcc ctggctgctg gactcttctt 

gctgtctcag gaccccagga acgtgtgggt 



ccggtgttgg gtgcaggcct ggtgtggtct 60 
tggcccactg gttcctttac attattatgg 120 
tgggattatt ggctatgcaa aagcaggtag 180 
tgggggcctg gcaggcctgg gtgcctacca 240 
tttcctagct ac 282 



<210> 72 
<211> 282 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701507568H1 

<400> 72 

cgccgtcgtc ctccagcgca ggcctccggg 
tgtggtctcc aaagtgactg aacaatgcag 
tattatggtt tcggctatgc ggccctggtg 
gcaggtagtg tgccgtccct ggctgctgga 
gcctaccagc tgtctcagga ccccaggaac 



ctccagctcc ggtgttgggt gcaggcctgg 60 

aaggacagtg gcccactggt tcctttacat 120 

gctactggtg ggattattgg ctatgcaaaa 180 

ctcttctttg ggggcctggc aggcctgggt 240 

gtgtgggttt tc 282 



<210> 73 
<211> 281 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 700300118H1 



<400> 73 



cgccgtcgtc ctccagcgca ggcctccggg 
tgtggtctcc aaagtgactg aacaatgcag 
tattatggtt tcggctatgc ggccctggtg 
gcaggtagtg tgccgtccct ggctgctgga 
gcctaccagc tgtctcagga ccccaggaac 



ctccagctcc ggtgttgggt gcaggcctgg 60 
aaggacagtg gcccactggt tcctttacat 120 
gctactggtg ggattattgg ctatgcaaaa 180 
ctcttctttg ggggcctggc aggcctgggt 240 
gtgtgggttt t 281 



<210> 74 
<211> 292 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700301710H1 



<400> 74 



cctgnacctc tgttcctgtg ctcccgccgt 

ctccggtgtt gggtgcaggc ctggtgtggt 

agtggcccac tggttccttt acattattat 

ggtgggatta ttggctatgc aaaagcaggt 

tttgggggcc tggcaggcct gggtgcctac 



cgtcctccag cgcaggcctc cgggctccag 60 
ctccaaagtg actgaacaat gcagaaggac 120 
ggtttcggct atgcggccct ggtggctact 180 
agtgtgccgt ccctggctgc tggactcttc 240 
cagctgtctc aggaccccag ga 292 
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<210> 75 
<211> 289 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID Ko.: 700064344H1 

<400> 75 

cagcgcaggc ctccgggctc cagctccggt 
ccaccctgga attgttccgt acgcgcaggc 
gagctagcgg actgcccgcc tctctaaaac 
cctaaacgcg cgtgcgcttc ttccacgcca 
aaagtgatga acatgcagaa ggacantggc 



gttgggtgtg ttcttacttt gcggatttta 60 
gcgcgggcgc tctcccgtgc actctctgct 120 
gtcctgtaac tgcggttccg ggagtggaaa 180 
cggaaaccgt gcaggcctgg tgtggtctcc 240 
ccactggttc ttanatatt 289 



<210> 76 
<211> 276 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701423273H1 

<400> 76 

agcgcaggcc tcagggctcc agctccggtg 
tgactgaaca atgcagaagg acagtggccc 
ctatgcggcc ctggtggcta ctggtgggat 
gtccctggct gctggactct tctttggggg 
tcaggacccc aggaacgtgt gggttttcct 



ttgggtgcag gcctggtgtn gtctccaaag 60 
actggttcct ttacattatt atggtttcgg 120 
tattggctat gcaaaagcag gtagtgtgcc 180 
cctggcaggc ctgggtgcct accagctgtc 240 
agctac 276 



<210> 77 
<211> 293 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700225847H1 

<400> 77 

ccgtcgtcct ccagcncagg cctccgggct 
tggtctccaa agtgactgaa caatgcagaa 
ttatggtttc ggctatgcgg ccctggtggc 
aggtagtgtg ccgtccctgg ctgctggact 
taccagctgt ctcaggaccc cagaacgtgt 



ccagctccgg tgttgggtgc aggcctggtg 60 
ggacagtggc ccactggttc ctttacatta 120 
tactggtggg attattggct atgcaaaagc 180 
ctctttgggg gcctggcang cctgggtgcc 240 
gggtttccta gctacgtctg gga 293 



<210> 78 
<211> 274 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701462776H1 

<400> 78 

tgctcccgcc gtcgtcctcc agcgcaggcc tccgggctcc agctccggtg ttgggtgcag 60 
gcctggtgtg gtctccaaag tgactgaaca atgcagaagg acagtggcnc actggttcct 120 
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ttacattatt atggtttcgg ctatgcggcc ctggtggcta ctggtgggat tattggctat 180 
gcaaaagcag gtagtgtgcc gtccctggct gctggactct tctttggggg cctggcaggc 240 
ctgggtgcct accagctgtc tcaggacccc agga 274 



<210> 79 
<211> 282 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No. : 700916803H1 

<400> 79 

gtgctcccgc cgtcgtcctc cagcgcaggc ctccgggctc cagctnccgg tgttgggtgt 60 

gttcttactt tgcggatttt accaccctgg aattgttccg tacgcgcagg cgcgcggggc 12 0 

tctcccgtgc actctctgct gagctagcgg actgcccgcc tctctaaaac gtcctgtaac 180 

tgcggttccg ggagtggaaa cctaaacgcg cgtgcgcttc ttccacgcca cggaaaccgt 240 

gcaggcctgg tgtggtctcc aaagtgactg aacaatgcag aa 282 



<210> 80 
<211> 280 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700478141H1 

<400> 80 

gccgtcgtcc tccagcgcag gcctccgggc tccagctccg gtgttgggtg caggcctggt 60 

gtggtctcca aagtgatgaa caatgcagaa ggacagtggc ccactggttc ctttacatta 120 

ttatggtttc ggctatgcgg ccctggtggc tactggtggg attattggct atgcaaaagc 180 

aggtagtgtg ccgtccctgg ctgctggact cttctttggg ggcctggcag gcctgggtgc 240 

ctaccagctg tctcaggacc ccaggaacgt gtgggttttc 280 



<210> 81 
<211> 299 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> mis cofeature 

<223> Incyte ID No.: 701646690H1 

<400> 81 

tncctccngg ctccagctcc ggtgttgggt gcaggcctgg tgtggtctcc aaagtnactg 60 

aacaatgcan aangacagtn gcccactggt tcctttacnt tattatggtt tcngntatgc 120 

ngccctggtg gctactggtg ggattattgg ctatgcaaaa ncaggtagtg tgccgtccct 180 

ggctgntgga ntcttctttg ggggcctggc aggcctgggt gcctaccagc tgtctcagga 240 

ccccaggaac gtgtgggttt tcctagctac gtctggnact ttggctggca tatggggat 299 



<210> 82 
<211> 286 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701624261H1 
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<400> 82 



tctcctccac aggtgcangc ctggtgtggt 
agtggcccac tggttccttt acattattat 
ggtgggatta ttggctatgc aaaagcaggt 
tttgggggcc tggcaggcct gggtgcctac 
gttttcctag ctacgtctgg gactttggct 



ctccaaagtg actgnncaat gcagaaggac 60 

ggtttcggct atgcggccct ggtggctact 120 

agtgtgccgt ccctggctgc nngactcttc 180 

cagctgtctc aggaccccag gaacgtgtgg 240 

ggcattatgg ggatga 286 



<210> 83 

<211> 266 

<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700912920H1 



<400> 83 



gcagaaggac agtggcccac tggttccttt 
ggtggctact ggtgggatta ttggctatgc 
tggactcttc tttgggggcc tggcaggcct 
gaacgtgtgg gttttcctag ctacgtctgg 
ctacaactct gggaaattta tgcctg 



acattattat ggtttcggct atgcggccct 60 
aaaagcaggt agtgtgccgt ccctggctgc 120 
gggtgcctac cagctgtctc aggaccccag 180 
gactttggct ggcattatgg ggatgagatt 240 

266 



<210> 84 
<211> 262 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 701482566H1 



<400> 84 



ctggctgctg gactcttctt tgggggcctg 
gaccccagga acgtgtgggt tttcctagct 
atgagattct acaactctgg gaaatttatg 
ctgatggttg ccaaacttgg acttagtatg 
ccctgcgtgg gctcatgatg ag 



gcaggcctgg gtgcctacca gctgtctcag 60 
acgtctggga ctttggctgg cattatgggg 120 
cctgcaggtt tgatcgcggg agccagtttg 180 
ttgagttcac cccatccgta gtagccatag 240 

262 



<210> 85 
<211> 285 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 700270272H1 



<400> 85 



ctgttcctgt gctcccgccg tcgtcctcca 
tgggtgcagg cntgntgtgg tctccaaagt 
ctggttcctt tacattatta tggtttcggc 
attggctatg caaaagcagg tagtgtgccg 
cctggcaggc ctgggtgcct accagctgtc 



gcncaggcct ccgggctcca gctccggtgt 60 
gactgaacaa tgcagaagga cagtggccca 120 
tatgcggccc tggtggctac tggtgggatt 180 
tccctggcct gctggactct tctttggggg 240 
tcaggacccc aggaa 285 



<210> 86 
<211> 268 
<212> DNA 
<213> Rattus 



norvegicus 
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<220> 

<221> misc_feature 
<223> Incyte ID No. 

<400> 86 



700628520H1 



ctccagcnca ggcctccggg ctccagctcc ggtgttgggt gcaggcctgg tgtggtctcc 60 

aaagtgactg aacaatgcag aaggacagtg gcccactggt tcctttacat tattatggtt 120 

tcggctatgc ggccctggtg gctactggtg ggattattgg ctatgcaaaa gcaggtagtg 180 

tgccgtccct ggctgctgga ctcttctttg ggggcctggc aggcctgggt gcctaccagc 240 

tgtctcagga ccccaggaac gtgtgggt 268 



<210> 87 
<211> 269 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_feature 
<223> Incyte ID No. 

<400> 87 



700534975H1 



tgctcccgcc gtcgtcctcc agcgcaggcc tccgggctcc agctccggtg ttgggtgcag 60 

gcctggtgtg gtctccaaag tgactgaaca atgcagaagg acagtggctc actggttcct 120 

ttacattant atggtttcgg ctatgcggcc ctggtggcta ctggtgggat tattggctat 180 

gcaaaagcag gtagtgtgcc gtccctggct gctggactct tctttggggg cctggcaggc 240 

ctgggtgcct accagctgtc tcaggaccc 269 



<210> 88 
<211> 262 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_f eature 
<223> Incyte ID No. 

<400> 88 



700176004H1 



tatgcngccc tggtggctac tggtgggatt attggctatg canaagcagg tagtgtgccg 60 
tccctggctg ctggactctt ctttgggggc ctggcaggcc tgggtgccta ccagctgtct 120 
caggacccca ggaacgtgtg ggttttccta gctacgtctg ggactttggc tggcattatg 180 
gggatgagat tctacaactc tgggaaattt atgcctgcag gtttgatcgc gggagccagt 240 
ttgctgatgg ttgccaaact tg 2 62 



<210> 89 
<211> 349 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_f eature 
<223> Incyte ID No. 



701609236H1 



<400> 89 

cgtacgcgca 
gcctctctaa 
ttcttccacg 
cagaaggaca 
gtggctactg 
ctctcttggg 



ggcgcgcggg 
aacgtcctgt 
ccacggaaac 
gtggcccact 
gtgggatatt 
ggctngcagc 



gctctcccgt 
aactgcggtt 
cgtgcaggcc 
ggttccttta 
ggctatgcaa 
ctggtgctaa 



gcactctctg 
ccgggagtgg 
tggtgtggtc 
cattattatg 
aagcagtatg 
caactgtctc 



gctgagcnng 
aaacctaaac 
tccaaagtga 
gtttcggcta 
tgccgtccct 
agancccag 



cggactgccc 60 
gcgcgtgcgc 120 
ctgaacaatg 180 
tgcggccctg 240 
ggctgctgga 3 00 
349 
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<210> 90 
<211> 263 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 70.1473437H1 

<400> 90 

agcncaggcc tccgggctcc agctccggtg 
tgactgaaca atgcagaagg acagtggccc 
ctatgcggcc ctggtggcta ctggtgggat 
gtccctggct gctggactct tctttggggg 
tcaggacccc aggaacgtgt ggg 



ttgggtgcag gcctggtgng gtctccaaag 60 
actggttcct ttacattatt atggtttcgg 120 
tattggctat gcaaaagcag gtagtgtgcc 180 
cctggcaggc ctgggtgcct accagctgtc 240 

263 



<210> 91 
<211> 303 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc„f eature 

<223> Incyte ID No.: 701606089H1 

<400> 91 

gcgcaggcct ccggggctcc agctccggtg 
tgactgaaca atgcagaagg acgttngccc 
ctatgcggcc ctggtggcta ctggtgggan 
gtccctngct gctggactct tctttngggg 
tcangacccc aggaacgtgt gggttttccn 
gga 



ttgggtgcag gcctggtgtg gtctccaaag 60 
actggntcct ttacattatt atggtttcgg 120 
tattggctat gcaaaagcag gtagtgtgcc 180 
cctgncangc ctgggtgcct accagctgtc 240 
agctacgtct gggatttgnc tggcatatng 300 

303 



<210> 92 
<211> 273 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701736525H2 

<400> 92 

taactgctcc gacctctcct ccacaggtgc 

caatgcagaa ggacagtggc ccactggttc 

ccctggtggc tactggtggg attattggct 

ctgctggact cttctttggg ggcctggcag 

ccaggaacgt gtgggttttc ctagctacgt 



aggcctggtg tggtctccaa agtgactgaa 60 
ctttacatta ttatggtttc ggctatgcgg 120 
atgcaaaagc aggtagtgtg ccgtccctgg 180 
gcctgggtgc ctaccagctg tctcaggacc 240 
ctg 273 



<210> 93 
<211> 262 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701532848H1 

<400> 93 

cngccgtcnt cctccagcgc angcntccgg gctccagctc cggtgttggg tgcaggcctg 60 
gtgtggtctc caaagtgact gaacaatgca gaaggacagt ggcncactgg ttcctttaca 120 
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ttattatggt ttcggctatg cggccctggt ggctactggt gggattattg gctatgcaaa 180 
agcaggtagt gtgccgtccc tggctgctgg actcttcttt gggggcctgg caggcctggg 240 
tgcctaccag ctgtctcagg ac 262 



<210> 94 
<211> 247 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700181220H1 



<400> 94 



aaaacgtcct gtaactgcgg ttccgggagt 
cgccacggaa accgtgcagg cctggtgtgg 
cagtggccca ctggttcctt tacattatta 
tggtgggatt attggctatg caaaagcagg 
ctttggg 



ggaaacctaa acgcgcgtgc gcttcttcca 60 
tctccaaagt gactgaacaa tgcagaagga 120 
tggtttcggc tatgcggccc tggtggctac 180 
tagtgtgccg tccctggctg ctggactctt 240 

247 



<210> 95 
<211> 284 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_ feature 

<223> Incyte ID No.: 701462707H1 

<400> 95 



tacacacccg gctcctgacc tctgttcctg tgctcccgcc gtcgtcctcc agcgcaggcc 60 
tccgggctcc agctccggtg ttgggtgcan gcctggtgtg gtctccaaag tgactgaaca 120 
atgcagaagg acagtggccc actggttcct ttacattatt atggtttcgg ctatgcggcc 180 
ctggtggcta ctggtgggat tattggctat gcaaaagcag gtagtgtgcc gtccctggct 240 
gctggactct tctttggggg cctggcaggc ctgggtgcct acca 284 



<210> 96 
<211> 282 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701462863H1 



<400> 96 



tacacacccg gctcctgacc tctgttcctg tgctcccgcc gtcgtcctcc agcgcaggcc 60 
tccgggctcc agctccggtg ttgggtgcag gcctggtgtg gtctccaaag tgactgaaca 120 
atgcagaagg acagtggccc actggttcct ttacattatt atggtttcgg ctatgcggcc 180 
ctggtggcta ctggtgggat tattggctat gcaaaagcag gtagtgtgcc gtccctggct 240 
gctggactct tctttggggg cctggcaggc ctgggtgcct ac 282 



<210> 97 
<211> 281 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> incyte ID No.: 701481465H1 



40/52 



WO 01/36684 



PCT/US00/31743 



<400> 97 

ttcttaactg ctccgacctc tcctccacag gtgcaggcct ggtgtggtct ccaaagtgac 60 

tgaacaatgc agaaggacag tggcccactg gttcctttac attattatgg tttcggctat 120 

gcggccctgg tggctactgg tgggattatt ggctatgcaa aagcaggtag tgtgccgtcc 180 

ctgggctgct ggactcttct ttgggggcct ggcaggcctg ggtgcctacc agctgtctca 240 

ggaccccagg aacgtgtggg ttttcctagc tacgtctggg a 281 

<210> 98 
<211> 265 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701308467H1 

<400> 98 

tgttcctgtg ctcccgccgt cgtcctccag cgcaggcctc cgggctccag ctccggngtt 60 

gggtgcaggc ctggtgtggt ctccaaagtg actgaacaat gcagaaggac agtggcccac 12 0 

tggttccttt acattattat ggtttcggct atgcggccct ggtggctact ggtgggatta 180 

ttggctatgc aaaagcaggt agtgtgccgt ccctggctgc tggactcttc tttgggggcc 240 

tgnagnctgg gtgcctacca gctgt 265 



<210> 99 
<211> 291 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_f eature 
<223> Incyte ID No. 



701564368H1 



<400> 99 

gggggcctgg caggcctggg tgcctaccag ctgtctcagg accccaggaa cgtgtgggtt 60 

ttcctagcta cgtctgggac tttggctggc attatgggga tgagattcta caactctggg 120 

aaatttatgc ctgcaggttt gatcgcggga gccagtttgc tgatggttgc caaacttgga 180 

cttagtatgt tgagttcacc ccatccgtag tagccatagt cctgcgtggg ctcatgatga 240 

gttgacactc tccagtcctc cacattacca cgctgaagag ataagaacag c 291 



<210> 100 
<211> 271 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> mi sc__f eature 
<223> Incyte ID No. 



700533180H1 



<400> 100 

caggtagtgt gccgtccctg gctgctggac tcttctttgg gggcctggca ggcctgggtg 60 

gcctaccagc tgtcctcagg aaccccagga acgtgtgggt tttcctagct acgtctggga 120 

ctttggctgg cattatgggg atgagattct acaactctgg gaaatttatg cctgcaggtt 180 

tgatcgcggg agccagtttg ctgatggttg ccaaacttgg acttagtatg ttgagttcac 240 

cccatccgta gtagccatag tcctgcgtgg g 271 



<210> 101 
<211> 255 
<212> DNA 

<213> Rattus norvegicus 
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<220> 

<221> misc_feature 

<223> Incyte ID No.: 700124647H1 

<400> 101 



ccgtcgtcct ccagcncagg cctccgggct 
tggtctccaa agtgactgaa caatgcagaa 
ttatggtttc ggctatgcgg ccctggtggc 
aggtagtgtg ccgtccctgg ctgctggatc 
tannagctgt ctcaa 



ccagctccgg tgttgggtgc aggcctggtg 60 
ngacagtggc ccactggttc ctttacatta 120 
tactggtggg attattggct atgcaaaagc 180 
ttctttgggg gcctggcagg cctgggtgcc 240 

255 



<210> 102 
<211> 297 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700537020H1 



<400> 102 



gccctggtgg ctactggtgg gattattggc 
gctgcnggac tcttctttgg gggcctggca 
ccccaggaac gtgtgggttt tcctagctac 
agattctaca actctgggaa atttatgcct 
atggttgcca aacttggact tagtatgttg 



tatgcaaaag caggtagtgt gccgtccctg 60 
ggcctgggtg cctacnagct aggctcagga 120 
tctggaccnt nggctggcat tatggggatg 180 
gcaggtttga tcgcgggagc cagtttgctg 240 
agttcacccc atccgtagta gccatag 297 



<210> 103 
<211> 261 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700765205H1 



<400> 103 



gacctctgtt cctgtgctcc cgccgtcgtc ctccagcgca ggcctccggg ctccagctcc 60 

ggtgttgggt gcaggcctgg tgtggtctcc aaagtgactg aacaatgcag aaggacagtg 120 

gcccactggt tcctttacat tattatggtt tcggctatgc ggccctggtg gctactggtg 180 

ggattattgg ctatgcaaaa gcaggtagtg tgccgtccct ggctgctgga ctcttctttg 240 

ggggcctggc aggctgggtg c 261 



<210> 104 
<211> 312 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701942992H1 



<400> 104 



cgacgtctac ncacccggct cctgacctct 
gcaggcctcc gggctccagc tccggtgttg 
ctgaacaatg cagaaggaca gtggcccact 
tgcggccctg gtggctactg gtgggattat 
cctggctgct ggactcttct ttgggggcct 
ncccaggnan nt 



gttcctgtgc tcccgccgtc gtcctccagc 60 
ggtgcaggcc tggtgtggtc tccaaagtna 120 
ggttccttta cattattatg gtttcggcta 180 
tggctatgca aaagcaggta gtgtgccgtc 240 
ggcagcctgg ggcctacaag ttttntcagg 300 

312 
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<210> 105 
<211> 241 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701197694H1 

<400> 105 

tgctcccgcc gtcgtcctcc agcgcaggcc 
gcctggtgtg gtctccaaag tgactgaaca 
ttacattatt atggtttcgg ctatgcggcc 
gcaaaagcag gaacgtgtgg gttttcctag 

g 



tccgggctcc agctccggtg ttgggtgcag 60 
atgcagaagg acagtggccc actggttcct 120 
ctggtggcta ctggtgggat tattggctat 180 
ctacgtctgg gactttggct ggnattatgg 240 

241 



<210> 106 
<211> 268 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701024952H1 

<400> 106 

cccggctcct gacctctgtt cctgtgctcc 
ctccagctcc ggtgttgggt gcaggcctgg 
aaggncagtg gcccactggt tcctttacat 
gctactggtg ggattattgg ctatgcaaaa 
ctctnctttn ggggcctggc aggcttag 



cgccgtcgtc ctccagcgca ggcctccggg 60 
tgtggtctcc aaagtgactg nacaatgcag 120 
tattatggtt tcggctatgc ggccctggtg 180 
gcaggtagtg tgccgtccct ggctgctgga 240 

268 



<210> 107 
<211> 318 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701582676H1 

<400> 107 

gcctaccagc tgtctcagga ccccaggaac 
ttggctggca ttatggggat gagattctac 
atcgcgggag ccagtttgct gatggttgcc 
catccgtagt agccatagtc ctgcgtgggc 
acattaccac gctgaagaga taagaacagc 
gacgtggtta ctatagtg 



gtgtgggttt tcctagctac gtctgggact 60 
aactctggga aatttatgcc tgcaggtttg 120 
aaacntggac ttagtatgtt gagttcaccc 180 
tcatgatgag ttgacactct ccagtcctcc 240 
aaagacctac actgagcaca tggaggcgaa 3 00 

318 



<210> 108 
<211> 255 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701293154H1 

<400> 108 

ggattattgg ctattgcaaa agcaggtaag tgtgccgtcc ctggctgctg gactcttctt 60 
tgggggcctg gcaggcctgg gtgcctacca gctgtctcag gaccccagga acgtgtgggt 120 
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tttcctagct acgtctggga ctttggcttg cattatgggg atgagattct acaactctgg 180 
gaaatttatg cctgcaggtt tgatcgcggg agccagtttg ctgatggttg ccaaacttgg 240 
attagtatgt tgagg 255 



<210> 109 
<211> 254 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701298824H1 

<400> 109 

catgcgcagg cctccgggct ccatgctccg 
aaagngactg aacaatgcag aaggacagtg 
tcggctatgc ggccctggtg gctactggtg 
tgccgccctg gctgctggac tcttctttgg 
ctcaggaccc agga 



gtgttgggtg catggcctgg tgnggtctcc 60 
gcccactggt tcctttacat tattatggnt 120 
ggattattgg ctatgcaaaa gcnggtagtg 180 
gggcctgcag nctgggtgcc taccagctgt 240 

254 



<210> 110 
<211> 294 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700524204H1 

<400> 110 

tcaggacccc aggaacgtgt gggttttcct agctacgtct gggactttgg ctggcattat 60 

ggggatgaga ttctacaact ctgggaaatt tatgcctgca ggtttgatcg cgggagccag 120 

tttgctgatg gttgccaaac ttggacttag tatgttgagt tcaccccatc cgtagtagcc 180 

atagccctgc gtgggctcat gatgagttga cactctccag tcctctacat taccacgctg 240 

aagagataag aacagcaaag acctacactg agcacatgga ggcgaagagt ggtt 294 



<210> 111 
<211> 289 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700067537H1 

<400> 111 

gacgtctaca cacccggctc ctgacctctg ttcctgtgct cccgccgtcg tcctccagcg 60 

caggcctccg ggctccagct ccgctgttgg gtgcaggcct ggtgtggtct ccaaagtgac 120 

tgaacaatgc agaaggacag tggcccactg gttcctttac attattatgg tttcggctat 180 

gcggccctgg tggctactgg tgggattatt ggctatgcaa aagcagtagt gtgccgtccc 240 

tggctgctgg atcttctttg ggggctggca ggctgggtgc ctacaactg 289 



<210> 112 
<211> 276 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701258019H1 
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<400> 112 



tgttcctgtg ctcccgccgt cgtcctccag cgcaggcctc cgggctccag ctccggtgtt 60 

gggtgcaggc ctggtgtggt ctccaaagtg actgaacaat gcatgaagga cagttggccc 120 

actggttcct ttacattatt atggnttccg gctatgcggc cctggtggct actggtgnga 180 

ttattggcta tgcaaaagca ggtagtgtgc cgccctggct gctggactct tctttggggg 240 

cctgcagnct ggtgcctacc agctgctctg cgtngg 27 6 



<210> 113 
<211> 254 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 700532493H1 



<400> 113 



tcangacccc aggaacgtgt gggttttcct 
ggggctgaga ttctacaact ctgggaaatt 
tttgctgatg gttgccaaac ttggacttag 
atagccctgc gtgggctcat gatgagttgc 
gagatanaac age a 



agetaegtet gggactttgg cnggcattat 60 
tatgectgea ggtttgatcg egggagecag 120 
tatgttgagt tcaccccatc egtagtagee 180 
atctccagtc ctctacatta ccacgctgaa 240 

254 



<210> 114 
<211> 282 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700523302H1 



<400> 114 



ctccagcnca ggcctccggg ctccagctcc 
aaagtgactg aacaatgeag aaggacagtg 
gggtanangn ncccgnnnng nnaagggggn 
ccgtccctgg ctgctggact cttctttggg 
tctcaggacc ecaggaaegg tgggtttccn 



ggtgttgggt geaggectgg tgtggtctcc 60 
gcccactggt tcctttanat aatnatggtc 120 
atnttgnnnt acgnaagagc ngntagtgtg 180 
ggcctggcag gcctgggtgc ctaccagctg 24 0 
agctacgncg gg 282 



<210> 115 
<211> 256 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> mis cofeature 

<223> Incyte ID No.: 701242719H1 



<400> 115 



cacacccggc tcctgacctc tgttcctgtg 
cgggctccag ctccgntgtt gggtgcaggc 
gcagaaggac agtggcccac tggttccttt 
ggnggctact ggtgggatta ttggctatca 
actcttcttt ggggee 



ctcccgccgn cgncctccag cgcaggcctc 60 
ctggtgtggt ctccaaagtg actgaacaat 120 
acattattat ggtttcggct atgcggccct 180 
aaagcaggta gtgtgccgcc ctggctgtgg 240 

256 



<210> 116 
<211> 244 
<212> DNA 

<213> Rattus norvegicus 
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<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701226025H1 

<400> 116 



cattattatg gtttcggcta tgcggccctg 

aaagcaggta gtgtgccgcc ctggctgctg 

tgcctaccag ctgctcagga ccccaggaac 

ttgctggcat tatggggatg agattctaca 
cgcg 



gtggctactg gtgggattat tggctatgca 60 
nctcttcttt ggaggcctgg caggcctggg 120 
gtgtgggttt tcctagctac gtctgggact 180 
actctgggaa atttatcctg caggtttgat 240 

244 



<210> 117 
<211> 262 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 701293276H1 



<400> 117 



cgtctacaca cccggctcct gacctctgtt 
aggcctccgg gctccagctc cggtgttggg 
gaacaatgca gaaggacagt ggcccactgg 
ggcccttggt ggctactggt gggattattg 
tggctgtgga ctctctntgn gg 



cctgtgctcc cgcccgtcgt cctccagcgc 60 
tgcaggcctg gngtggtctc caaagtgact 120 
ttcctttaca ttattatggt ttcggctatc 180 
gctatgcaaa agcaggtagt gtgccgtccc 240 

262 



<210> 118 
<211> 261 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700493358H1 



<400> 118 



caggcctggt gtggtctcca aagtgactga 

cctttacatt attatggttt cggctatgcg 

tatgcaaaag caggtagtgt gccgtccctg 

ggcctgggtn canacnantg tctaggnccc 

ggnnttggnc canaaangga a 



acaatgcaga aggacagtgg cccactggtt 60 
gccctggtgg ctactggtgg gattattggc 120 
gctgctggac tcttctttgg ggncntggca 180 
caagaaangt gggttnccca aannaggggg 240 

261 



<210> 119 
<211> 265 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700533285H1 



<400> 119 



ccttgaactc atttcttcct gactgctaga 
tcctccacag gtgcaggcct ggtgtggtct 
tggcccactg gttcctttac attattatgg 
tgggattatt ggtatgcaaa agcaggtagt 
gggggcctgg caggcctggg tgcct 



ggcctgtgtg ttcttaactg ctccgacctc 60 
ccaaagtgac tgaacaatgc agaaggacag 120 
tttcggctat gcggccctgg tggctactgg 180 
gtgccgtccc tggctgctgg actcttcttt 240 

265 
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<210> 120 
<211> 247 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700920823H1 



<400> 120 



cgtgnacgtc tacacacccg gctcctgacc tctgttcctg tgctcccgcc gtcgtcctcc 60 

agcgcaggcc tcccgggctc cagctccggt gttgggtgca ggcctggtgt ggtctccaaa 120 

gtgactgaac aatgcagaag gacagtggcc cactggttcc tttacattat tatggtttcg 180 

gctatgcggc cctggtggct actggtggga ttattgctat gcaaaagcag gtagtctgcc 240 

gctccct 247 



<210> 121 
<211> 263 
<212> DNA 

<213> Rattus norvegicus 



<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700627607H1 



<400> 121 



gacgtctaca cacccggctc ctgacctctg 

caggcctccg ggctccagct ccggtgttgg 

tgaacaatgc agaaggacag tggcccactg 

gcggccctgg tggctactgg tgggattatt 

ggcnanctcg nnccgaggng nnc 



ttcctgtgct cccgccgtcg tcctccagcg 60 
gtgcaggcct ggtgtggtct ccaaagtgac 120 
gttcctttac attattatgg tttcggctat 180 
ggctatgcaa anccagntat cgccggcncn 240 

263 



<210> 122 
<211> 265 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> mi sc_f eature 

<223> Incyte ID No.: 700437944H2 

<400> 122 



ctccgntgtt gggtgcaggc ctggtgtant 
cantggccca ctggttcctt tacattattn 
tgntgggatt attggctatn caaaagcagg 
ctttgggggc ctgacaggct gggtgcctac 
ngttttccta agctacntct gggac 



ctccaaagtg actgaacaat gaagcaggac 60 
tngtttcggc tatncggccc tgntngctac 120 
tagtgtnccg tccctggctg ctggactctt 180 
cagctgtctc angcacccca ggaacgtgtg 240 

265 



<210> 123 
<211> 343 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_f eature 

0<223> Incyte ID No.: 701582848H1 



<400> 123 



gctaccagct gtctcaggac ccaggaacgt gtgggtttcc tagctacgtc tgggactttg 60 
gctggcatta tggggatgag attctacnac tctgggaaat ttatgcctgc aggtttgatc 120 
gcgggagcca nttgctgata gttgccaact tngacttagt atgttgagtn caccccatcc 180 
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gtagtagcat ancctgcgtg ggctcagatg agtnacactc tccaggcctc cacatttacc 240 
aggctgaaga gtaagacagc aaagactaca tgagcacntg aggnaaacgt ggttntatat 300 
gacgttcaag acgcgatgnt gactcagact ncntgctcat egg 343 



<210> 124 
<211> 241 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701305531H1 



<400> 124 



gaegtctaca cacncggctc ctgacctctg 
caggcctccg ggctccagct ccggagttgg 
tgaacaatgc agaaggacag tggcccactg 
gcggccctgg tggctactgg tggattattg 

g 



ttcctgngct cccgccgncg acctccagcg 60 
gtgeaggect ggngtgnnct ccaaagtgac 120 
gttcctttac attattatgg atteggctat 180 
gctatcaaaa gcaggagtgt ccgccctgct 240 

241 



<210> 125 
<211> 155 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 700916103H1 



<400> 125 



gtgctcccgc cgtcgtcctc cagcgcaggc ctccgggctc cagctccggt gttgggtgca 60 
ggcctggtgt ggtctccaaa gtgactgaac aatgcagaag gacagtggcc cactggttcc 120 
tttacattat tatggtttcg getatgegge cctgg 155 



<210> 126 
<211> 185 
<212> DNA 

<213> Rattus norvegicus 
*220> 

<221> misc_feature 

<223> Incyte ID No.: 701294764H1 

<400> 126 

ccgccgtcgt ccttcagcgc aaggnctccg 
ggagtggnct ccaaagtgac tgaacaatgc 
attattatgg ttteggctat gcggccctgg 
agegg 



ggctccagct ccggagttgg gngcaggect 60 
agaaggacan tggcccactg gntcctttac 120 
aggenactgg gggnatattg getatneaaa 180 

185 



<210> 127 
<211> 125 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc„f eature 

<223> Incyte ID No.: 700066710H1 

<400> 127 

ctcttctttg ggggectgne caggctgggt gcctaccagc tgtctcagga ccccaggaac 60 
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gtgtgggttt tcctagctac gtctgggact ttggctggca ttatggggat gagattctac 120 
aactc 125 



<210> 128 
<211> 266 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 701471559H1 



<400> 128 



tttatgcctg cnggtttgat cgcgggagcc 
agtatgttga gttcacccca tccgtagtag 
gacactctcc agtcctctac attaccacgc 
tgagcacatg gaggcgaaga cgtggttact 
gacctcagag ctcacactgc cttcat 



agtttgctga tggttgccaa acttggactt 60 
ccatagccct gcgtgggctc atgatgagtt 120 
tgaagagata agaacagcaa agacctacac 180 
atagtgaccg ttcagagntg gcgagtgtct 240 

266 



<210> 129 
<211> 208 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700325006H1 



<400> 129 



ggcaggcctg ggtgcctacc agctgtctca 
ctacgtctgt gactttggct gancattatt 
ttatgcctgc aggtttnatc gcggncancc 
tagtangntn anttcacccc ntgccgtc 



ggacnccagg nacgtgtggg ntttcctaga 60 
ngggatgana ttctaacaac tctgggaaat 120 
agtttgnntg atggttgcca aacttggact 180 

208 



<210> 130 
<211> 263 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 701258479H1 



<400> 130 



gcagagctag ggcgagcaag tggctgtgtg ttcaagggcc agttgcatcc gcacccagtg 60 

cttgtacctt gaactcattt cttcctgact gctagaggcc tgtgtgttct taactgctcc 120 

gacctctcct ccacaggtgc aggcctggtg tggnctccaa agtgactgaa caatgcagaa 180 

ggacagtggc ccactggctc ctttacatta ttatggnttc ggctatgcgg cctggtggct 240 

actggnggna ttattggcta tgc 263 



<210> 131 
<211> 258 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> mis cofeature 

<223> Incyte ID No.: 700627187H1 
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<400> 131 



aatttatgcc tgcaggttga tcgcnggagc 
taggatgttg agttcacccc atcccggagt 
ttgacactct ccagtcctcc acattaccac 
actgagcaca tggaggcgaa gacgtggtta 
ctgactcaga gctcacac 



cagtttgctg atggttgcca aacttngact 60 
agccatagtc ctgcgtgggc tcatgatgag 120 
gctgaagaga taagaacagc aaagacctac 180 
ctatagtgac cgttcagaga cggcgagtgt 240 

258 



<210> 132 
<211> 272 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_f eature 

<223> Incyte ID No.: 701246066H1 



<400> 132 



gcgggagcca gtttgctgat ggntgccaaa cttggactta gnatgttgag ntcaccccnt 60 
ncgtagtagc catagtcctg cgtggtctca tgatgagttg acactctcca gtcctncaca 120 
Ottaccacgct gaagagatan gaacagcaaa gacctacact gagcacatgg aggcgaagac 180 
gtggttacta tagtgaccgt tcagagacgg cgagtgtctg acctcagagc tcacactgct 240 
tcatgcggct tgntcttgtg catgatgctc ng 272 



<210> 133 
<211> 253 
<212> DNA 
<213> Rattus 



norvegicus 



<220> 

<221> misc_feature 

<223> Incyte ID No.: 700594190H1 



<400> 133 



atccgtagta gccatagccc tgcgtgggct 
cattaccacg ctgaagagat aagaacagca 
acgtggttac tatagtgacc gttcagagac 
ccttcatgcg gcttgttctt gtgtcatgat 
ggtaaaatgt tgg 



catgatgagt tgacactctc cagtcctcta 60 
aagacctaca ctgagcacat ggaggcgaag 120 
ggcgagtgtc tgacctcaga gctcacactg 180 
gtctcgactc tctgtactac tacataaagg 240 

253 



<210> 134 
<211> 267 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 700627108H1 



<400> 134 



gaattgatnc ctggcaggtt gatcgcggga 
gncttngtat ctgagttcaa cccnatcggt 
tgnnttgaac actctccagt cagtccagat 
aagacctaca ctgagcacca tggaggcgaa 
acggcgngtg tntggatcan agatcca 



gccagttttg ctgatggttg acaaactttg 60 
agtagccata agtctanccn gggntcatga 120 
naacgncgct gntagagatn aagaccagcn 180 
gacgtggtta ctataagtga ccgttcagag 240 

267 



<210> 135 
<211> 650 
<212> DNA 
<213> Rattus 



norvegicus 
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<220> 

<223> RnAUG.conN 
<400> 135 

gtgctcccgc cgtcgtcctc cagcgcaggc ctccgggctc cagctccggt gttgggtgtg 60 
ttcttacttt gcggatttta ccaccctgga attgttccgt acgcgcaggc gcgcgggcgc 120 
tctcccgtgc actctctgct gagctagcgg actgcccgcc tctctaaaac gtcctgtaac 180 
tgcggttccg ggagtggaaa cctaaacgcg cgtgcgcttc ttccacgcca cggaaaccgt 240 
gcaggcctgg tgtggtctcc aaagtgactg aacaatgcag aaggacagtg gcccactggt 300 
tcctttacat tattatggtt tcggctatgc ggccctggtg gctactggtg ggattattgg 360 
ctatgcaaaa gcaggtagtg tgccgtccct ggctgctgga ctcttctttg ggggcctggc 420 
aggcctgggt gcctaccagc tgtctcagga ccccaggaac gtgtgggttt tcctagctac 480 
gtctgggact ttggctggca ttatggggat gagattctac aactctggga aatttatgcc 540 
tgcaggtttg atcgcgggag ccagtttgct gatggttgcc aaacttggac ttagtatgtt 600 
gagttcaccc catccgtagt agccatagcc ctgcgtgggc tcatgatgag 650 



<210> 136 
<211> 114 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<223> RnAUG.conP 



<400> 136 



Met 


Gin 


Lys Asp 


Ser 


Gly 


Pro 


Leu 


Val 


Pro 


Leu 


His 


Tyr 


Tyr 


Gly 


1 






5 










10 










15 


Phe 


Gly 


Tyr Ala 


Ala 


Leu 


Val 


Ala 


Thr 


Gly 


Gly 


He 


He 


Gly 


Tyr 








20 










25 










30 


Ala 


Lys 


Ala Gly 


Ser 


Val 


Pro 


Ser 


Leu 


Ala 


Ala 


Gly 


Leu 


Phe 


Phe 




35 










40 










45 


Gly Gly 


Leu Ala 


Gly 


Leu 


Gly 


Ala 


Tyr 


Gin 


Leu 


Ser 


Gin 


Asp 


Pro 








50 










55 










60 


Arg 


Asn 


Val Trp 


Val 


Phe 


Leu 


Ala 


Thr 


Ser 


Gly 


Thr 


Leu 


Ala 


Gly 




65 










70 










75 


He 


Met 


Gly Met 


Arg 


Phe 


Tyr 


Asn 


Ser 


Gly 


Lys 


Phe 


Met 


Pro 


Ala 








80 










85 










90 


Gly 


Leu 


He Ala 


Gly 


Ala 


Ser 


Leu 


Leu 


Met 


Val 


Ala 


Lys 


Leu 


Gly 






95 










100 










105 


Leu 


Ser 


Met Leu 


Ser 


Ser 


Pro 


His 


Pro 















110 



<210> 137 
<211> 223 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No.: 746355H1 

<400> 137 

ctacgcagca ctggttgctt ctggtgggat 

gtccctggct gcagggctgc tctttggcag 

tcaggatcca aggaacgttt gggttttcct 

gggaatgagg ttctaccact ctggaaaatt 



cattggctat gtaaaagcag gcagcgtgcc 60 

tctagccggc ctgggtgctt accagctgtc 120 

agctacatct ggtaccttgg ctggcattat 180 

catgcctgca ggt 223 



<210> 138 
<211> 243 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No.: 1294663H1 

<400> 138 

ggaaaattca tgcctgtagg tttaattgca ggtgccagtt tgctgatggc cgccaaagtt 60 

ggagttcgta tgttgatgac atctgattag cagaagtcat gttccagctt ggactcatga 120 

aggattaaaa atctgcatct tccactattt tcaatgtatt aagagaaata agtgcagcat 180 

ttttgcatct gacattttac ctaaaaaaaa aaagacacca aatttggcgg aggggtggaa 240 

aat 0 243 
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